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1 


Fundamentals 


Combinatorics is often described briefly as being about counting, and indeed counting is 
a large part of combinatorics. As the name suggests, however, it is broader than this: it 
is about combining things. Questions that arise include counting problems: “How many 
ways can these elements be combined?” But there are other questions, such as whether a 
certain combination is possible, or what combination is the “best” in some sense. We will 
see all of these, though counting plays a particularly large role. 

Graph theory is concerned with various types of networks, or really models of networks 
called graphs. These are not the graphs of analytic geometry, but what are often described 
as “points connected by lines”, for example: 


The preferred terminology is vertex for a point and edge for a line. The lines need not 
be straight lines, and in fact the actual definition of a graph is not a geometric definition. 
The figure above is simply a visualization of a graph; the graph is a more abstract object, 
consisting of seven vertices, which we might name {v1,...,v7}, and the collection of pairs 
of vertices that are connected; for a suitable assignment of names v; to the points in 
the diagram, the edges could be represented as {v1, v2},{v2, u3},{v3, v4},{v3, Us },{U4, Us }, 


{us, V6 },{V6, U7}. 
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1.1 EXAMPLES 


Suppose we have a chess board, and a collection of tiles, like dominoes, each of which is the 
size of two squares on the chess board. Can the chess board be covered by the dominoes? 
First we need to be clear on the rules: the board is covered if the dominoes are laid down so 
that each covers exactly two squares of the board; no dominoes overlap; and every square 
is covered. The answer is easy: simply by laying out 32 dominoes in rows, the board can 
be covered. To make the problem more interesting, we allow the board to be rectangular 
of any size, and we allow some squares to be removed from the board. What can be say 
about whether the remaining board can be covered? This is such a board, for example: 


What can we say? Here is an easy observation: each domino must cover two squares, so 
the total number of squares must be even; the board above has an even number of squares. 
Is that enough? It is not too hard to convince yourself that this board cannot be covered; 
is there some general principle at work? Suppose we redraw the board to emphasize that 
it really is part of a chess board: 


Aha! Every tile must cover one white and one gray square, but there are four of the 
former and six of the latter, so it is impossible. Now do we have the whole picture? No; 
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for example: 


The gray square at the upper right clearly cannot be covered. Unfortunately it is not easy 
to state a condition that fully characterizes the boards that can be covered; we will see 
this problem again. Let us note, however, that this problem can also be represented as 
a graph problem. We introduce a vertex corresponding to each square, and connect two 
vertices by an edge if their associated squares can be covered by a single domino; here is 
the previous board: 


Here the top row of vertices represents the gray squares, the bottom row the white squares. 
A domino now corresponds to an edge; a covering by dominoes corresponds to a collection 
of edges that share no endpoints and that are incident with (that is, touch) all six vertices. 
Since no edge is incident with the top left vertex, there is no cover. 

Perhaps the most famous problem in graph theory concerns map coloring: Given a 
map of some countries, how many colors are required to color the map so that countries 
sharing a border get different colors? It was long conjectured that any map could be 
colored with four colors, and this was finally proved in 1976. Here is an example of a small 
map, colored with four colors: 


Typically this problem is turned into a graph theory problem. Suppose we add to each 
country a capital, and connect capitals across common boundaries. Coloring the capitals so 


10 


that 
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no two connected capitals share a color is clearly the same problem. For the previous 


map: 


Any 


graph produced in this way will have an important property: it can be drawn so that 


no edges cross each other; this is a planar graph. Non-planar graphs can require more 


than four colors, for example this graph: 


This 
each 


is called the complete graph on five vertices, denoted Ks; in a complete graph, 
vertex is connected to each of the others. Here only the “fat” dots represent vertices; 


intersections of edges at other points are not vertices. A few minutes spent trying should 


convince you that this graph cannot be drawn so that its edges don’t cross, though the 


number of edge crossings can be reduced. 


Exercises 1.1. 


1. 


Explain why an m x n board can be covered if either m or n is even. Explain why it cannot 
be covered if both m and n are odd. 


Suppose two diagonally opposite corners of an ordinary 8 x 8 board are removed. Can the 
resulting board be covered? 


Suppose that m and n are both odd. On an m x n board, colored as usual, all four corners 
will be the same color, say white. Suppose one white square is removed from any location 
on the board. Show that the resulting board can be covered. 


Suppose that one corner of an 8 x 8 board is removed. Can the remainder be covered by 
1 x 3 tiles? Show a tiling or prove that it cannot be done. 
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5. Suppose the square in row 3, column 3 of an 8 x 8 board is removed. Can the remainder be 
covered by 1 x 3 tiles? Show a tiling or prove that it cannot be done. 


6. Remove two diagonally opposite corners of an m x n board, where m is odd and n is even. 
Show that the remainder can be covered with dominoes. 


7. Suppose one white and one black square are removed from an n x n board, n even. Show 
that the remainder can be covered by dominoes. 


8. Suppose an nxn board, n even, is covered with dominoes. Show that the number of horizontal 
dominoes with a white square under the left end is equal to the number of horizontal dominoes 
with a black square under the left end. 


9. In the complete graph on five vertices shown above, there are five pairs of edges that cross. 
Draw this graph so that only one pair of edges cross. Remember that “edges” do not have 
to be straight lines. 


10. The complete bipartite graph K3,3 consists of two groups of three vertices each, with all 
possible edges between the groups and no other edges: 


Draw this graph with only one crossing. 


1.2 COMBINATIONS AND PERMUTATIONS 


We turn first to counting. While this sounds simple, perhaps too simple to study, it is 
not. When we speak of counting, it is shorthand for determining the size of a set, or more 
often, the sizes of many sets, all with something in common, but different sizes depending 
on one or more parameters. For example: how many outcomes are possible when a die is 
rolled? Two dice? n dice? As stated, this is ambiguous: what do we mean by “outcome” ? 
Suppose we roll two dice, say a red die and a green die. Is “red two, green three” a 
different outcome than “red three, green two”? If yes, we are counting the number of 
possible “physical” outcomes, namely 36. If no, there are 21. We might even be interested 
simply in the possible totals, in which case there are 11 outcomes. 

Even the quite simple first interpretation relies on some degree of knowledge about 
counting; we first make two simple facts explicit. In terms of set sizes, suppose we know 
that set A has size m and set B has size n. What is the size of A and B together, that is, 
the size of AU B? If we know that A and B have no elements in common, then the size 
AUB is m+n; if they do have elements in common, we need more information. A simple 
but typical problem of this type: if we roll two dice, how many ways are there to get either 
7 or 11? Since there are 6 ways to get 7 and two ways to get 11, the answer is 6 + 2 = 8. 
Though this principle is simple, it is easy to forget the requirement that the two sets be 
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disjoint, and hence to use it when the circumstances are otherwise. This principle is often 
called the addition principle. 

This principle can be generalized: if sets A; through A, are pairwise disjoint and have 
sizes M1,...™Mn, then the size of Ay U--- UA, = >>;_, m;. This can be proved by a simple 
induction argument. 

Why do we know, without listing them all, that there are 36 outcomes when two dice 
are rolled? We can view the outcomes as two separate outcomes, that is, the outcome 
of rolling die number one and the outcome of rolling die number two. For each of 6 
outcomes for the first die the second die may have any of 6 outcomes, so the total is 
6+6+6+6+6-+6 = 36, or more compactly, 6-6 = 36. Note that we are really using the 
addition principle here: set A; is all pairs (1,2), set Ag is all pairs (2,2), and so on. This 
is somewhat more subtle than is first apparent. In this simple example, the outcomes of 
die number two have nothing to do with the outcomes of die number one. Here’s a slightly 
more complicated example: how many ways are there to roll two dice so that the two dice 
don’t match? That is, we rule out 1-1, 2-2, and so on. Here for each possible value on 
die number one, there are five possible values for die number two, but they are a different 
five values for each value on die number one. Still, because all are the same, the result is 
5+5+5+5+5+5 = 30, or 6-5 = 30. In general, then, if there are m possibilities for one 
event, and n for a second event, the number of possible outcomes for both events together 
is m-n. This is often called the multiplication principle. 

In general, if n events have m; possible outcomes, for 7 = 1,...,n, where each m;, is 
unaffected by the outcomes of other events, then the number of possible outcomes overall 
is [[;_, m:. This too can be proved by induction. 


EXAMPLE 1.2.1 How many outcomes are possible when three dice are rolled, if no two 
of them may be the same? The first two dice together have 6 - 5 = 30 possible outcomes, 
from above. For each of these 30 outcomes, there are four possible outcomes for the third 
die, so the total number of outcomes is 30-4 = 6-5-4 = 120. (Note that we consider the 
dice to be distinguishable, that is, a roll of 6, 4, 1 is different than 4, 6, 1, because the first 
and second dice are different in the two rolls, even though the numbers as a set are the 


same.) 


EXAMPLE 1.2.2 Suppose blocks numbered 1 through n are in a barrel; we pull out k 
of them, placing them in a line as we do. How many outcomes are possible? That is, how 
many different arrangements of k blocks might we see? 

This is essentially the same as the previous example: there are k “spots” to be filled by 
blocks. Any of the n blocks might appear first in the line; then any of the remaining n — 1 
might appear next, and so on. The number of outcomes is thus n(n—1)(n—2)---(n—k+1), 
by the multiplication principle. In the previous example, the first “spot” was die number 
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one, the second spot was die number two, the third spot die number three, and 6-5-4 = 
6(6 — 1)(6 — 2); notice that 6-2 =6-—3+41. 


This is quite a general sort of problem: 


DEFINITION 1.2.3 The number of permutations of n things taken k at a time is 


Ost) ee 2 Sea) ae 


A permutation of some objects is a particular linear ordering of the objects; P(n, k) 
in effect counts two things simultaneously: the number of ways to choose and order k out 
of n objects. A useful special case is k = n, in which we are simply counting the number 
of ways to order all n objects. This is n(n —1)---(n-—n+1) =n!. Note that the second 
form of P(n,k) from the definition gives 


n! n! 


(n—n)! OF 


This is correct only if 0! = 1, so we adopt the standard convention that this is true, that 
is, we define 0! to be 1. 

Suppose we want to count only the number of ways to choose k items out of n, that is, 
we don’t care about order. In example 1.2.1, we counted the number of rolls of three dice 
with different numbers showing. The dice were distinguishable, or in a particular order: a 
first die, a second, and a third. Now we want to count simply how many combinations of 
numbers there are, with 6, 4, 1 now counting as the same combination as 4, 6, 1. 


EXAMPLE 1.2.4 Suppose we were to list all 120 possibilities in example 1.2.1. The list 
would contain many outcomes that we now wish to count as a single outcome; 6, 4, 1 and 
4, 6, 1 would be on the list, but should not be counted separately. How many times will a 
single outcome appear on the list? This is a permutation problem: there are 3! orders in 
which 1, 4, 6 can appear, and all 6 of these will be on the list. In fact every outcome will 
appear on the list 6 times, since every outcome can appear in 3! orders. Hence, the list is 


too big by a factor of 6; the correct count for the new problem is 120/6 = 20. 


Following the same reasoning in general, if we have n objects, the number of ways to 
choose k of them is P(n,k)/k!, as each collection of k objects will be counted k! times by 
Pack) 
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DEFINITION 1.2.5 The number of subsets of size k of a set of size n (also called an 


n-set) is 
P(n,k n! n 
C= .  - k(n —k)! ( ). 


The notation C(n,k) is rarely used; instead we use ae pronounced “n choose k”. 


EXAMPLE 1.2.6 Consider n = 0,1, 2,3. It is easy to list the subsets of a small n-set; a 
typical n-set is {a1,a2,...,@n}. A 0-set, namely the empty set, has one subset, the empty 
set; a 1-set has two subsets, the empty set and {a,}; a 2-subset has four subsets, 0, {a1}, 
{a2}, {a1, a2}; and a 3-subset has eight: @, {a1}, {a2}, {a3}, {a1, a2}, {a1, a3}, {a2, a3}, 
{a1,@2,a3}. From these lists it is then easy to compute (ys 


k 
k 
Ooh 28 
0}; 1 
FL yee | 
Det eh ee Sal 
3 | 1 3 1 


You probably recognize these numbers: this is the beginning of Pascal’s Triangle. 
Each entry in Pascal’s triangle is generated by adding two entries from the previous row: 
the one directly above, and the one above and to the left. This suggests that (2) = 
Cra) + (ee and indeed this is true. To make this work out neatly, we adopt the 


convention that (i) =Owhenk<Oork>n. 


n n—1 n—1 
THE EM 1.2. = . 
oo i (3) Cal k ) 


Proof. A typical n-set is A = {a1,...,a,}. We consider two types of subsets: those 
that contain a, and those that do not. If a k-subset of A does not contain ay, then it is 


a k-subset of {a1,...,@,—1}, and there are eC) of these. If it does contain a,,, then it 
consists of a, and k — 1 elements of {a1,...,@n—1}; since there are (ey of these, there 
are ae subsets of this type. Thus the total number of k-subsets of A is Ce) + Cae 

Note that when k = 0, Cy) = (Hay = 0, and when k = n, (aig = anaes = 0, 
so that (5) = Ca and (") = (esis These values are the boundary ones in Pascal’s 
Triangle. = 


Many counting problems rely on the sort of reasoning we have seen. Here are a few 
variations on the theme. 
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EXAMPLE 1.2.8 Six people are to sit at a round table; how many seating arrangements 
are there? 

It is not clear exactly what we mean to count here. If there is a “special seat”, for 
example, it may matter who ends up in that seat. If this doesn’t matter, we only care 
about the relative position of each person. Then it may or may not matter whether a 
certain person is on the left or right of another. So this question can be interpreted in (at 
least) three ways. Let’s answer them all. 

First, if the actual chairs occupied by people matter, then this is exactly the same 
as lining six people up in a row: 6 choices for seat number one, 5 for seat two, and so 
on, for a total of 6!. If the chairs don’t matter, then 6! counts the same arrangement too 
many times, once for each person who might be in seat one. So the total in this case is 
6!/6 = 5!. Another approach to this: since the actual seats don’t matter, just put one 
of the six people in a chair. Then we need to arrange the remaining 5 people in a row, 
which can be done in 5! ways. Finally, suppose all we care about is who is next to whom, 
ignoring right and left. Then the previous answer counts each arrangement twice, once for 


the counterclockwise order and once for clockwise. So the total is 5!/2 = P(5,3). 


We have twice seen a general principle at work: if we can overcount the desired set in 
such a way that every item gets counted the same number of times, we can get the desired 
count just by dividing by the common overcount factor. This will continue to be a useful 
idea. A variation on this theme is to overcount and then subtract the amount of overcount. 


EXAMPLE 1.2.9 How many ways are there to line up six people so that a particular 
pair of people are not adjacent? 

Denote the people A and B. The total number of orders is 6!, but this counts those 
orders with A and B next to each other. How many of these are there? Think of these 
two people as a unit; how many ways are there to line up the AB unit with the other 4 
people? We have 5 items, so the answer is 5!. Each of these orders corresponds to two 
different orders in which A and B are adjacent, depending on whether A or B is first. So 
the 6! count is too high by 2-5! and the count we seek is 6! — 2-5! =4-5!. 


Exercises 1.2. 


1. How many positive factors does 2-3*-7°-11?-47° have? How many does p{! pS? - -- p&” have, 


where the p,; are distinct primes? 


2. A poker hand consists of five cards from a standard 52 card deck with four suits and thirteen 
values in each suit; the order of the cards in a hand is irrelevant. How many hands consist 
of 2 cards with one value and 3 cards of another value (a full house)? How many consist of 
5 cards from the same suit (a flush)? 
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3. Six men and six women are to be seated around a table, with men and women alternating. 
The chairs don’t matter, only who is next to whom, but right and left are different. How 
many seating arrangements are possible? 


4. Eight people are to be seated around a table; the chairs don’t matter, only who is next to 
whom, but right and left are different. Two people, X and Y, cannot be seated next to each 
other. How many seating arrangements are possible? 


5. In chess, a rook attacks any piece in the same row or column as the rook, provided no other 
piece is between them. In how many ways can eight indistinguishable rooks be placed on a 
chess board so that no two attack each other? What about eight indistinguishable rooks on 
a 10 x 10 board? 


6. Suppose that we want to place 8 non-attacking rooks on a chessboard. In how many ways 
can we do this if the 16 most ‘northwest’ squares must be empty? How about if only the 4 
most ‘northwest’ squares must be empty? 


7. A “legal” sequence of parentheses is one in which the parentheses can be properly matched, 
like ()(()). It’s not hard to see that this is possible precisely when the number of left and right 
parentheses is the same, and every initial segment of the sequence has at least as many left 
parentheses as right. For example, ())... cannot possibly be extended to a legal sequence. 
Show that the number of legal sequences of length 2n is Cp, = (4) - (ae The numbers 
C, are called the Catalan numbers. 


1.3. BINOMIAL COEFFICIENTS 


Recall the appearance of Pascal’s Triangle in example 1.2.6. If you have encountered the 
triangle before, you may know it has many interesting properties. We will explore some of 
these here. 

You may know, for example, that the entries in Pascal’s Triangle are the coefficients 
of the polynomial produced by raising a binomial to an integer power. For example, 
(2 +y)? =1-2°4+3-22y+3-ary?+1-y%, and the coefficients 1, 3, 3, 1 form row three of 


Pascal’s Triangle. For this reason the numbers ( ") are usually referred to as the binomial 
coefficients. 


THEOREM 1.3.1 Binomial Theorem 


n n n n n-1 n n—-2,2 n n . n n—-1t, 4 
(x+y) = (5) +(f)z v+(S)e y bt (Oy -d(j)= y 


Proof. We prove this by induction on n. It is easy to check the first few, say for 
n = 0,1,2, which form the base case. Now suppose the theorem is true for n — 1, that is, 


n—-1 aa Te yg 
(x+y) = , )* y 


Then 
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Using the distributive property, this becomes 
n—-1 n—-1 
n—I1 -1-i, i m—-1)\ n-1-i, 4 
ey (* pete (My ets 
i=0 i=0 


= 
n— aay n—1—i, i+1 
ECP eS (pve 


— 


These two sums have much in common, but it is slightly disguised by an “offset”: the first 


sum starts with an 2”y° term and ends with an x'y"—! term, while the corresponding 


terms in the second sum are «”~'y! and x°y”. Let’s rewrite the second sum so that they 


match: 


E(w EC 


Now we can use theorem 1.2.7 to get: 


‘i "| a+ s(" : ') 4 Coo + (" 7 i) y”. 


At the next to last step we used the facts that ("a7) = (5) and ae? = ae = 


Here is an interesting consequence of this theorem: Consider 
(@t+y)"=(@+y)@t+y)---@+y). 


One way we might think of attempting to multiply this out is this: Go through the n 
factors (x + y) and in each factor choose either the x or the y; at the end, multiply your 
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choices together, getting some term like rryryy--- yx = xy, where of course i +7 = n. 
If we do this in all possible ways and then collect like terms, we will clearly get 


nm 
y any. 
i=0 


We know that the correct expansion has (7) = aj;; is that in fact what we will get by this 
method? Yes: consider x”~*y’. How many times will we get this term using the given 
method? It will be the number of times we end up with 7 y-factors. Since there are n 
factors (x+y), the number of times we get i y-factors must be the number of ways to pick 
i of the (x+y) factors to contribute a y, namely a. This is probably not a useful method 


in practice, but it is interesting and occasionally useful. 


EXAMPLE 1.3.2 Using this method we might get 


he + y)? = ULL + LLY + LYL + LYyY + YLL + yxy + yyxr + yyy 


which indeed becomes x? + 3x7y + 3xy? + y® upon collecting like terms. 


The Binomial Theorem, 1.3.1, can be used to derive many interesting identities. A 
common way to rewrite it is to substitute y = 1 to get 


n 
gr = 3) 
XC 
that is, row n of Pascal’s Triangle sums to 2”. This is also easy to understand combinato- 
rially: the sum represents the total number of subsets of an n-set, since it adds together 
the numbers of subsets of every possible size. It is easy to see directly that the number of 
subsets of an n-set is 2”: for each element of the set we make a choice, to include or to 
exclude the element. The total number of ways to make these choices is 2-2---2 = 2”, by 
the multiplication principle. 
Suppose now that n > 1 and we substitute —1 for x; we get 


(-1+1)"= 3 (") (a1j)"*, (1.3.1) 


i=0 
The sum is now an alternating sum: every other term is multiplied by —1. Since the left 
hand side is 0, we can rewrite this to get 


()+ Gr +G)r~ = 


So each of these sums is 2”~!. 
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Another obvious feature of Pascal’s Triangle is symmetry: each row reads the same 
forwards and backwards. That is, we have: 


THEOREM 1.3.3 (T= ( i ) 


i n—1t 
Proof. This is quite easy to see combinatorially: every i-subset of an n-set is associated 
with an (n — i)-subset. That is, if the n-set is A, and if B C A has size i, then the 
complement of B has size n —7. This establishes a 1-1 correspondence between sets of size 
i and sets of size n — i, so the numbers of each are the same. (Of course, if i = n — i, no 
proof is required.) 7 


Note that this means that the Binomial Theorem, 1.3.1, can also be written as 


or 


Another striking feature of Pascal’s Triangle is that the entries across a row are strictly 
increasing to the middle of the row, and then strictly decreasing. Since we already know 
that the rows are symmetric, the first part of this implies the second. 


THEOREM 1.3.4 pris gl. (“)>(,")). 
) a 


Proof. ‘This is by induction; the base case is apparent from the first few rows. Write 
(= Ga) CT) 
-J= |. + 
a t=1 a 
n feat #: n=l 
i-1/ \i-2 i-l 


Provided that 1 <i < [24], we know by the induction hypothesis that 


Provided that 1<7i—1< [254], or equivalently 2<i< [254] +1, we know that 


Hence if 2<i< [254], 


This leaves two special cases to check: i = 1 and for n even, i = |"5+| +1= |%]. These 


are left as an exercise. P| 
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Exercises 1.3. 


1. 


Suppose a street grid starts at position (0,0) and extends up and to the right: 


(0, 0) 
A shortest route along streets from (0,0) to (i,7) is i+ 7 blocks long, going 7 blocks east and 
j blocks north. How many such routes are there? Suppose that the block between (k,/) and 
(k+ 1,1) is closed, where k <i and 1 < j. How many shortest routes are there from (0,0) to 


(i, 5)? 


. Prove by induction that S77", (4) — eee for n > 0 andi > 0. 


i+1 


. Use a combinatorial argument to prove that )77_, (*) = aes for n > 0 and i > 0; that is, 


i i+] 
explain why the left-hand side counts the same thing as the right-hand side. 


4. Use a combinatorial argument to prove that C) + Ce + k(n—k) = (3). 


5. Use a combinatorial argument to prove that ( 


8. Verify that ("7*) + (3) =n?. Use exercise 2 to find a simple expression for ))"_, i. 


10. 
11. 


12. 


13. 


2n 


=) is even. 


. Suppose that A is a non-empty finite set. Prove that A has as many even-sized subsets as it 


does odd-sized subsets. 


. Prove that S7/_4 (‘)k = ln — CT) for n > 0 andi > 0. 


i+1 i+2 
2 
2 


. Make a conjecture about the sums of the upward diagonals in Pascal’s Triangle as indicated. 


Prove your conjecture is true. 
1 
4 6 4. 1 


Find the number of ways to write n as an ordered sum of ones and twos, n > 0. For example, 
when n = 4, there are five ways: 1+1+1+1,2+1+4+1,14+2+1,1+1+42, and2+2. 


Use (a +1)” = 0”, (")2" to find a simple expression for )77_, i(")a*~*. Then find a simple 
expression for 07", i("). 
Use (« +1)" = Soi, (")a" to find a simple expression for }>7._5 a (")a*t*. Then find a 


. : n ar n 
simple expression for )7j9 74 C) 


Use the previous exercise to find a simple expression for yy (-1)' a (): 


14. 


15. 
16. 
17. 


18. 
19. 
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Give a combinatorial proof of 


x (")() -("s"): 


Rewrite this identity in simpler form if m =n, and when k = m =n. 
Finish the proof of theorem 1.3.4. 
Give an alternate proof of theorm 1.3.4 by characterizing those i for which ear 7 ) ell 


i-1 
When is (")/(,",) a maximum? When is (")/(,”,) = 2? 


i-1 a 

When is (") — (,",) a maximum? 

A Galton board is an upright flat surface with protruding horizontal pins arranged as shown 
below. At the bottom are a number of bins; if the number of rows is n, the number of bins 
isn+1. A ball is dropped directly above the top pin, and at each pin bounces left or right 
with equal probability. We assume that the ball next hits the pin below and immediately left 
or right of the pin it has struck, and this continues down the board, until the ball falls into 
a bin at the bottom. If we number the bins from 0 to n, how many paths can a ball travel 
to end up in bin k? 


This may be interpreted in terms of probability, which was the intent of Sir Francis 
Galton when he designed it. Each path is equally likely to be taken by a ball. If many balls 
are dropped, the number of balls in bin & corresponds to the probability of ending up in 
that bin. The more paths that end in a bin, the higher the probability. When a very large 
number of balls are dropped, the balls will form an approximation to the bell curve familiar 
from probability and statistics. 


There is an animation of the process at 
https: //www.randomservices.org/random/apps/GaltonBoardExperiment .html. 


There was once a very nice physical implementation at the Pacific Science Center in 
Seattle. 


1.4 BELL NUMBERS 


We begin with a definition: 
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DEFINITION 1.4.1 A partition of a set S is a collection of non-empty subsets A; C S, 
1 <i<k (the parts of the partition), such that UL, A; = S and for every i # j, 
A,N Aj = @. 


EXAMPLE 1.4.2 The partitions of the set {a,b,c} are {{a}, {b}, {c}}, {{a, b}, {ch}, 
{{a, ch, {oy t, (16, ch, tary, and {{a, b, ch}. 


Partitions arise in a number of areas of mathematics. For example, if = is an equiv- 


alence relation on a set S', the equivalence classes of = form a partition of S. Here we 
consider the number of partitions of a finite set S, which we might as well take to be 
[In] = {1,2,3,...,n}, unless some other set is of interest. We denote the number of parti- 
tions of an n-element set by B,,; these are the Bell numbers. From the example above, we 
see that B3 = 5. For convenience we let Bo = 1. It is quite easy to see that B, = 1 and 
Bz =2. 

There are no known simple formulas for B,,, so we content ourselves with a recurrence 


relation. 


THEOREM 1.4.3 The Bell numbers satisfy 


Buti = >~ 4 By. 


k=0 


Proof. Consider a partition of S = {1,2,...,n +1}, Ai,...,Am. We may suppose that 
n+1 isin Aj, and that |A;| = k+1, for some k,0 <k <n. Then Ag,...,Am form a 
partition of the remaining n—k elements of S, that is, of S'\\A,. There are B,_, partitions 
of this set, so there are B,_, partitions of S in which one part is the set A,. There are 
(2) sets of size k+1 containing n+1, so the total number of partitions of S in which n+1 
is in a set of size k + 1 is ay B,-,x. Adding up over all possible values of k, this means 


ee s & Bick, (1.4.1) 


k=0 


We may rewrite this, using theorem 1.3.3, as 


written backwards. | 
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EXAMPLE 1.4.4 We apply the recurrence to compute the first few Bell numbers: 


“EG a 


1 
er eee 


Beate k-p2s 1s 25 


w 


By =1-14+3-14+3-24+1-5=15 


) 
(i) 
) 
= () 


w iP 
jo) 
Poo SS fe NR OT ON 
sx b 


The Bell numbers grow exponentially fast; the first few are 1, 1, 2, 5, 15, 52, 203, 877, 
4140, 21147, 115975, 678570, 4213597, 27644437. 

The Bell numbers turn up in many other problems; here is an interesting example. 
A common need in some computer programs is to generate a random permutation of 
1,2,3,...,n, which we may think of as a shuffle of the numbers, visualized as numbered 
cards in a deck. Here is an attractive method that is easy to program: Start with the 
numbers in order, then at each step, remove one number at random (this is easy in most 
programming languages) and put it at the front of the list of numbers. (Viewed as a shuffle 
of a deck of cards, this corresponds to removing a card and putting it on the top of the 
deck.) How many times should we do this? There is no magic number, but it certainly 
should not be small relative to the size of n. Let’s choose n as the number of steps. 

We can write such a shuffle as a list of n integers, (m1,™m2,...,™n). This indicates 
that at step 7, the number m, is moved to the front. 


EXAMPLE 1.4.5  Let’s follow the shuffle (3, 2, 2,4, 1): 


(3): 31245 
(2): 23145 
(2): 23145 
(4): 42315 
(1): 14235 


Note that we allow “do nothing” moves, removing the top card and then placing it on 
top. The number of possible shuffles is then easy to count: there are n choices for the card 
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to remove, and this is repeated n times, so the total number is n”. (If we continue a shuffle 
for m steps, the number of shuffles is n™.) Since there are only n! different permutations 
of 1,2,...,n, this means that many shuffles give the same final order. 

Here’s our question: how many shuffles result in the original order? 


EXAMPLE 1.4.6 These shuffles return to the original order: (1,1,1,1,1), (5,4,3, 2,1), 
(4, 1,3, 2,1). 


THEOREM 1.4.7 The number of shuffles of [n] that result in the original sorted order 
is By. 


Proof. Since we know that B,, counts the number of partitions of {1,2,3,...,n}, we can 
prove the theorem by establishing a 1-1 correspondence between the shuffles that leave 
the deck sorted and the partitions. Given a shuffle (m1, 7mz2,...,7™n), we put into a single 
set all 7 such that m,; has a single value. For example, using the shuffle (4, 1,3,2,1), Since 
mz = Ms, one set is {2,5}. All the other values are distinct, so the other sets in the 
partition are {1}, {3}, and {4}. 

Note that every shuffle, no matter what the final order, produces a partition by this 
method. We are only interested in the shuffles that leave the deck sorted. What we now 
need is to show that each partition results from exactly one such shuffle. 

Suppose we have a partition with k parts. If a shuffle leaves the deck sorted, the last 
entry, Mn, must be 1. If the part containing n is A,, then it must be that m; = 1 if and 
only if i € A,. If k = 1, then the only part contains all of {1,2,...,n}, and the shuffle 
tosh: ber Lads. os), 

If k > 1, the last move that is not 1 must be 2, since 2 must end up immediately after 
1. Thus, if jo is the largest index such that jo ¢ Aj, let Ag be the part containing j2, and 
it must be that m; = 2 if and only if 7 € Ag. We continue in this way: Once we have 
discovered which of the m; must have values 1,2,...,p, let jp41 be the largest index such 
that jp4i1 ¢ Ar U---U Ap, let Api be the part containing j,41, and then m; = p+ 1 if 
and only if 7 € Api. When p = k, all values m; have been determined, and this is the 
unique shuffle that corresponds to the partition. = 


EXAMPLE 1.4.8 Consider the partition {{1,5}, {2,3,6}, {4,7}}. We must have m7 = 
ma =1, Me = M3 = M2 = 2, and ms = m1 = 3, so the shuffle is (3, 2,2, 1,3, 2,1). 


Returning to the problem of writing a computer program to generate a partition: is 
this a good method? When we say we want a random permutation, we mean that we 
want each permutation to occur with equal probability, namely, 1/n!. Since the original 
order is one of the permutations, we want the number of shuffles that produce it to be 
exactly n”/n!, but n! does not divide n”, so this is impossible. The probability of getting 
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the original permutation is B,,/n", and this turns out to be quite a bit larger than 1/n!. 
Thus, this is not a suitable method for generating random permutations. 

The recurrence relation above is a somewhat cumbersome way to compute the Bell 
numbers. Another way to compute them is with a different recurrence, expressed in the 
Bell triangle, whose construction is similar to that of Pascal’s triangle: 


Ai 1 

Aa A22 1 2 

A3,1 A32 A3,3 2 3 5 

Agi Ago A43 Aga 5 7 10 15 


The rule for constructing this triangle is: A; = 1; the first entry in each row is the last 
entry in the previous row; other entries are Ay,~. = An,~—1 + An—1,k-1; row n has n entries. 
Both the first column and the diagonal consist of the Bell numbers, with A, = By,—1 and 
Ann = Pe 

An, may be interpreted as the number of partitions of {1,2,...,2 + 1} in which 
{k + 1} is the singleton set with the largest entry in the partition. For example, A3,2 = 3; 
the partitions of 3+1 = 4 in which 2+ 1 = 3 is the largest number appearing in a singleton 
set are {{1}, {2,4}, {3}}, ({2}, {1.4}, (3H, and {{1, 2,4}, 3H} 

To see that this indeed works as advertised, we need to confirm a few things. First, 
consider A,,,,, the number of partitions of {1,2,...,n+1} in which {n+1} is the singleton 
set with the largest entry in the partition. Since n + 1 is the largest element of the set, 
all partitions containing the singleton {n + 1} satisfy the requirement, and so the B,, 
partitions of {1,2,...,n} together with {n+ 1} are exactly the partitions of interest, that 
16 Ag = Dae 

Next, we verify that under the desired interpretation, it is indeed true that An, = 
Anp—1 + An—1,r-1 for k > 1. Consider a partition counted by A, ,~1. This contains 
the singleton {k}, and the element & + 1 is not in a singleton. If we interchange & and 
k +1, we get the singleton {/ + 1}, and no larger element is in a singleton. This gives 
us partitions in which {k + 1} is a singleton and {k} is not. Now consider a partition 
of {1,2,...,n} counted by An—14.-1. Replace all numbers j > k by j + 1, and add the 
singleton {k + 1}. This produces a partition in which both {k +1} and {k} appear. In 
fact, we have described how to produce each partition counted by Ap, exactly once, and 
sO An,k = Ag pt + Aint Rats 

Finally, we need to verify that A,1 = By_1; we establish a bijection between the 
two sets. Suppose a partition in A,,; has the form {{1},{2},X1, Xe,...,Xz}, where 
no X; is a singleton. Then the collection {X1, X2,...,X,} is a partition of the n — 1 
element set {3,4,...,n + 1}. Now we subtract 2 from every element to get a parti- 
tion of [n — 1] that has no singleton sets. Any other partition in A, has the form 
{{2}, Xi, Xo,...,X~}, where no X; is a singleton. One of the sets X; contains n+1, say X}. 


26 Chapter 1 Fundamentals 


Remove n+ 1 and split the remainder of X, into singleton sets {x,}, {r2},...,{@m}. Now 
{{xi}, {x2},..., {Um}, Xo, X3,..., Xx} is a partition of the n—1 element set {1,3,...,n}. 
Subtracting one from all the elements except 1 produces a partition of [n — 1] with at least 
one singleton set. If p € Ani, designate the partition of [n — 1] obtained in this way by 


f(p). 


Now we define a function g from B,_; to Ay. If partition p € B,—-1 contains no sin- 
gletons, say p = {X1,..., Xz}, add two to every element forming sets X;*, and let g(p) = 


{{1}, {2}, X7,..., XZ}. Ifp contains singletons, say p = {{@1}, {%o},...,{@m}, Xo, X3,..., Xz}, 
then add 1 to all elements except 1, forming a partition {{xj}, {w5},..., {vt}, X3,X3,..., XZ} 
of the set {1,3,...,n}. Now let g(p) = {{2}, {x7 23,...,2%,,n +1}, X3,..., XZ}. 

Since f(g(p)) = p and g(f(p)) = p, f and g are inverses and so f is a bijection between 
A,,, and B,,_1, as desired. 


Exercises 1.4. 


1. Show that if {Ai, A2,..., Ax} is a partition of {1,2,...,n}, then there is a unique equivalence 
relation = whose equivalence classes are { Ai, A2,..., Ax}. 


2. Suppose n is a square-free number, that is, no number m? divides n; put another way, square- 
free numbers are products of distinct prime factors, that is, n = pipa---pe, where each p; 
is prime and no two prime factors are equal. Find the number of factorizations of n. For 
example, 30 = 2-3-5, and the factorizations of 30 are 30, 6-5, 10-3, 2-15, and 2-3-5. Note 
we count 30 alone as a factorization, though in some sense a trivial factorization. 


3. The rhyme scheme of a stanza of poetry indicates which lines rhyme. This is usually expressed 
in the form ABAB, meaning the first and third lines of a four line stanza rhyme, as do the 
second and fourth, or ABCB, meaning only lines two and four rhyme, and so on. A limerick 
is a five line poem with rhyming scheme AABBA. How many different rhyme schemes are 
possible for an n line stanza? To avoid duplicate patterns, we only allow a new letter into 
the pattern when all previous letters have been used to the left of the new one. For example, 
ACBA is not allowed, since when C is placed in position 2, B has not been used to the left. 
This is the same rhyme scheme as ABCA, which is allowed. 


4. Another way to express the Bell numbers for n > 0 is 
B= Se S(n,k), 
k=1 


where S(n,k) is the number of partitions of {1,2,...,n} into exactly k parts, 1 <k <n. 
The S(n,k) are the Stirling numbers of the second kind. Find a recurrence relation 
for S(n,k). Your recurrence should allow a fairly simple triangle construction containing the 
values S(n,k), and then the Bell numbers may be computed by summing the rows of this 
triangle. Show the first five rows of the triangle, n € {1,2,...,5}. 


5. Let A, be the number of partitions of {1,2,...,n + 1} in which no consecutive integers 
are in the same part of the partition. For example, when n = 3 these partitions are 


{{1}, {2}, {3}, (4, {11}, 12,44, (3h, {11 3), (23, (4, (0 3h 12,453, (1 4b {25 £33, 


so A3 = 5. Let A(n,k) be the number of partitions of {1,2,...,n +1} into exactly k parts, 
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in which no consecutive integers are in the same part of the partition. Thus 
nt+l1 


An = >_ A(n,k). 


Find a recurrence for A(n,k) and then show that An = Bn. 


1.5 CHOICE WITH REPETITION 


Most of the permutation and combination problems we have seen count choices made with- 
out repetition, as when we asked how many rolls of three dice are there in which each die 
has a different value. The exception was the simplest problem, asking for the total number 
of outcomes when two or three dice are rolled, a simple application of the multiplication 
principle. Typical permutation and combination problems can be interpreted in terms of 
drawing balls from a box, and implicitly or explicitly the rule is that a ball drawn from 
the box stays out of the box. If instead each ball is returned to the box after recording the 
draw, we get a problem essentially identical to the general dice problem. For example, if 
there are six balls, numbered 1-6, and we draw three balls with replacement, the number 
of possible outcomes is 6°. Another version of the problem does not replace the ball after 
each draw, but allows multiple “identical” balls to be in the box. For example, if a box 
contains 18 balls numbered 1-6, three with each number, then the possible outcomes when 
three balls are drawn and not returned to the box is again 6°. If four balls are drawn, 
however, the problem becomes different. 

Another, perhaps more mathematical, way to phrase such problems is to introduce 
the idea of a multiset. A multiset is like a set, except that elements may appear more 
than once. If {a,b} and {b,c} are ordinary sets, we say that the union {a,b} U {b,c} is 
{a,b,c}, not {a,b, b,c}. If we interpret these as multisets, however, we do write {a,b, b,c} 
and consider this to be different than {a,b,c}. To distinguish multisets from sets, and to 
shorten the expression in most cases, we use a repetition number with each element. 
For example, we will write {a,b,b,c} as {1-a,2-b,1-c}. By writing {1-a,1-b,1-c} we 
emphasize that this is a multiset, even though no element appears more than once. We 
also allow elements to be included an infinite number of times, indicated with oo for the 
repetition number, like {oo - a,5-b,3-c}. 

Generally speaking, problems in which repetition numbers are infinite are easier than 


those involving finite repetition numbers. Given a multiset A = {00-a1,00-ag,...,00-dn}, 
how many permutations of the elements of length k are there? That is, how many sequences 
£1, %2,...,£~% can be formed? This is easy: the answer is n”. 


Now consider combinations of a multiset, that is, submultisets: Given a multiset, how 
many submultisets of a given size does it have? We say that a multiset A is a submultiset 
of B if the repetition number of every element of A is less than or equal to its repetition 
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number in B. For example, {20-a,5-6,1-c} is a submultiset of {oo -a,5-b,3-c}. A 
multiset is finite if it contains only a finite number of distinct elements, and the repetition 
numbers are all finite. Suppose again that A = {00-a1,00- dg,...,00- dn}; how many 
finite submultisets does it have of size k? This at first seems quite difficult, but put in the 
proper form it turns out to be a familiar problem. Imagine that we have k+n—1 “blank 
spaces”, like this: 


Now we place n — 1 markers in some of these spots: 


A A A ae A 


This uniquely identifies a submultiset: fill all blanks up to the first A with a1, up to the 
second with a2, and so on: 


\ a2 A a3 a3 a3 A a4... An-1 Gn-1 NA Qn 


So this pattern corresponds to the multiset {1-a2,3-a3,...,1-a,}. Filling in the markers 
/ in all possible ways produces all possible submultisets of size k, so there are (a) 
such submultisets. Note that this is the same as ae ae the hard part in practice is 


remembering that the —1 goes with the n, not the k. 
e e e 


Summarizing the high points so far: The number of permutations of n things taken k 
at a time without replacement is P(n,k) = n!/(n — k)!; the number of permutations of n 
things taken k at a time with replacement is n*. The number of combinations of n things 
taken k at a time without replacement is tae the number of combinations of n things 


taken k at a time with replacement is (Ca 
e e e 
If A = {m1 -a1,M2-a2,...,™Mn + Gn}, similar questions can be quite hard. Here is an 


easier special case: How many permutations of the multiset A are there? That is, how many 
sequences consist of m , copies of a,, m1 copies of ag, and so on? This problem succumbs 
to overcounting: suppose to begin with that we can distinguish among the different copies 
of each a;; they might be colored differently for example: a red a,, a blue a,, and so 
on. Then we have an ordinary set with M = 5>;"_, m; elements and M! permutations. 
Now if we ignore the colors, so that all copies of a; look the same, we find that we have 
overcounted the desired permutations. Permutations with, say, the a; items in the same 
positions all look the same once we ignore the colors of the a;s. How many of the original 
permutations have this property? m ,! permutations will appear identical once we ignore 
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the colors of the a; items, since there are m,! permutations of the colored a;s in a given 
m , positions. So after throwing out duplicates, the number of remaining permutations is 
M!/m,! (assuming the other a, are still distinguishable). Then the same argument applies 
to the ags: oe are m2! copies of each permutation once we ignore the colors of the azs, so 


M! 
there are I distinct permutations. Continuing in this way, we see that the number 


my!my! 


of distinct permutations once all colors are ignored is 
M! 
mM !Mg!---My! 


This is frequently written 


M 
Mm, Mg ... Mn , 


called a multinomial coefficient. Here the second row has n separate entries, not a 
single product entry. Note that if n = 2 this is 


M M! M! M 
= => — . (1.6.1) 
m1 m2 my!m2!  m,!(M—m,)! m1 


This is easy to see combinatorially: given {m1 - a1,m2- a2} we can form a permutation 


by choosing the m, places that will be occupied by ay, filling in the remaining m2 places 
with a2. The number of permutations is the number of ways to choose the m, locations, 
which is eae 


EXAMPLE 1.5.1 How many solutions does 7; +22+%3+24 = 20 have in non-negative 
integers? That is, how many 4-tuples (m1,,m2,™m3,m4) of non-negative integers are solu- 
tions to the equation? We have actually solved this problem: How many submultisets of 
size 20 are there of the multiset {00 - a1, 00 - dz,00 - a3,00- a4}? A submultiset of size 
20 is of the form {mj - a1, m2 - d2,m3 - a3,m4- a4} where }>m; = 20, and these are in 
1-1 correspondence with the set of 4-tuples (m1, m2,™m3, m4) of non-negative integers such 
that 5>m,; = 20. Thus, the number of solutions is eee This reasoning applies in 


20 
general: the number of solutions to 


This immediately suggests some generalizations: instead of the total number of solu- 
tions, we might want the number of solutions with the variables x; in certain ranges, that 
is, we might require that m; < x; < M; for some lower and upper bounds m,; and M;. 
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Finite upper bounds can be difficult to deal with; if we require that 0 < x7; < M,, this 
is the same as counting the submultisets of {M1 - a1, M2-a2,...,Mn-an}. Lower bounds 


are easier to deal with. 


EXAMPLE 1.5.2 Find the number of solutions to 7; +272+%3+24 = 20 with x, > 0, 
t2>1,%3 > 2, 474 > —-1. 

We can transform this to the initial problem in which all lower bounds are 0. The 
solutions we seek to count are the solutions of this altered equation: 


£1 + (a2 — 1) + (a3 — 2) + (a4 +1) = 18. 


If we set yi = 41, yo = T2—1, y3 = 23 — 2, and ya = 44 +1, then (x1, 2%2,2%3,24) is a 
solution to this equation if and only if (y1, yo, y3, ya) is a solution to 


yi + yo + 93 + ys = 18, 


and moreover the bounds on the x; are satisfied if and only if y; > 0. Since the number 
18+4-1 


te F this is also the number of solutions to the 


of solutions to the last equation is ( 


original equation. 


Exercises 1.5. 


1. Suppose a box contains 18 balls numbered 1-6, three balls with each number. When 4 balls 
are drawn without replacement, how many outcomes are possible? Do this in two ways: 
assuming that the order in which the balls are drawn matters, and then assuming that order 
does not matter. 


2. How many permutations are there of the letters in Mississippi? 
3. How many permutations are there of the multiset {1-ai,1-a2,...,1-an}? 
4. Let M =>", mj. If ki < 0 for some i, let’s say 


M 
(., ko... .) = 


M “ M-1 
on ak ws) = ae cet CMe DY. er a 


Note that when n = 2 this becomes 


M\_f M-1 \ f( M-1 
m1 m2 7 (m1 — 1) m2 M1 (m2 — 1) ; 


As noted above in equation 1.5.1, when n = 2 we are really seeing ordinary binomial coeffi- 
cients, and this can be rewritten as 


(2) 


which of course we already know. 


Prove that 
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5. The Binomial Theorem (1.3.1) can be written 


n n a9 
(e+y)" = x ( ). y’, 
i+j=n #J 


where the sum is over all non-negative integers 7 and j that sum to n. Prove that for m > 2, 


(t1 +%2+++++2m)" = ) a. 2 ee ae eae 
41 42 «1. Um 
where the sum is over all 71,...,¢m such that 71 +---+im =n. 


6. Find the number of integer solutions to 


+%2+%34+%44+%5 = 50,01 > —3,%2 > 0,23 > 4,244 > 2,05 > 12. 


7. You and your spouse each take two gummy vitamins every day. You share a single bottle of 
60 vitamins, 30 of one flavor and 30 of another. You each prefer a different flavor, but it seems 
childish to fish out two of each type (but not to take gummy vitamins). So you just take the 
first four that fall out and then divide them up according to your preferences. For example, 
if there are two of each flavor, you and your spouse get the vitamins you prefer, but if three 
of your preferred flavor come out, you get two of the ones you like and your spouse gets one 
of each. Of course, you start a new bottle every 15 days. On average, over a 15 day period, 
how many of the vitamins you take are the flavor you prefer? (From fivethirtyeight.com.) 


1.6 THE PIGEONHOLE PRINCIPLE 


A key step in many proofs consists of showing that two possibly different values are in fact 
the same. The Pigeonhole principle can sometimes help with this. 


THEOREM 1.6.1 Pigeonhole Principle Suppose that n+ 1 (or more) objects are 
put into n boxes. Then some box contains at least two objects. 


Proof. Suppose each box contains at most one object. Then the total number of objects 
is at most 1+1+.---+1=n, a contradiction. a 


This seemingly simple fact can be used in surprising ways. The key typically is to put 
objects into boxes according to some rule, so that when two objects end up in the same 
box it is because they have some desired relationship. 


EXAMPLE 1.6.2. Among any 13 people, at least two share a birth month. 
Label 12 boxes with the names of the months. Put each person in the box labeled 
with his or her birth month. Some box will contain at least two people, who share a birth 


month. 


EXAMPLE 1.6.3 Suppose 5 pairs of socks are in a drawer. Picking 6 socks guarantees 
that at least one pair is chosen. 
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Label the boxes by “the pairs” (e.g., the red pair, the blue pair, the argyle pair,...). 


Put the 6 socks into the boxes according to description. 


Some uses of the principle are not nearly so straightforward. 


EXAMPLE 1.6.4 Suppose aj,...,@, are integers. Then some “consecutive sum” a, + 
Ak41 + Ge42 +--+: + Ak4m is divisible by n. 
Consider these n sums: 
§1 = a1 
$2 =a, + a2 


$3 =a, +a2+ a3 


Sn =A, + ag +°+-+4n 


These are all consecutive sums, so if one of them is divisible by n we are done. If not, 
dividing each by n leaves a non-zero remainder, r; = s; mod n, rg = sg mod n, and so on. 
These remainders have values in {1,2,3,...,n—1}. Label n — 1 boxes with these n — 1 
values; put each of the n sums into the box labeled with its remainder mod n. Two sums 
end up in the same box, meaning that s; mod n = s; mod n for some j > 2; hence s; — 8; 


is divisible by n, and s; — s; = aj41 + Qit2 +---+a,;, as desired. 


A similar argument provides a proof of the Chinese Remainder Theorem. 


THEOREM 1.6.5 Chinese Remainder Theorem _ If mand nare relatively prime, 
and 0 <a< mand 0 < b < n, then there is an integer x such that « mod m = a and 


xmodn = b. 


Proof. Consider the integers a,a +m,a+2m,...a+(n—1)m, each with remainder a 
when divided by m. We wish to show that one of these integers has remainder b when 
divided by n, in which case that number satisfies the desired property. 

For a contradiction, suppose not. Let the remainders be r9 = a mod n, r; = a+m mod 
N,---,; Tn-1 =a+(n—1)mmod n. Label n — 1 boxes with the numbers 0,1, 2,3,...,b— 
1,b+1,...n—1. Put each r; into the box labeled with its value. Two remainders end up 
in the same box, say r; and r;, with j > 7%, sor; =r; =r. This means that 


atim=qnt+r and a+jm=qon+r. 
Hence 
a+jm—(atim) =qn+r-—(qn+r) 
(j —t)m = (q2 — qi)n. 
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Since n is relatively prime to m, this means that n | (j —7). But since i and j are distinct 
and in {0,1,2,...,»-—1},0< j-—i<n,sonf(j —1). This contradiction finishes the 
proof. = 


More general versions of the Pigeonhole Principle can be proved by essentially the 
same method. A natural generalization would be something like this: If X objects are put 
into n boxes, some box contains at least m objects. For example: 


THEOREM 1.6.6 Suppose that r1,...,7 are positive integers. If X > (S7y, ri)—n4+1 
objects are put into n boxes labeled 1, 2,3,...,n, then some box labeled 7 contains at least 
r; objects. 


Proof. Suppose not. Then the total number of objects in the boxes is at most (r, —1)+ 
(ro -—1) + (r3 —1) +-++ + (rn — 1) = OF i) — 2 < X, a contradiction. : 


This full generalization is only occasionally needed; often this simpler version is suffi- 


cient: 


COROLLARY 1.6.7 Suppose r > 0 and X > n(r — 1) + 1 objects are placed into n 
boxes. Then some box contains at least r objects. 


Proof. Apply the previous theorem with r; =r for all 7. ai 


Here is a simple application of the Pigeonhole Principle that leads to many interesting 
questions. 


EXAMPLE 1.6.8 Suppose 6 people are gathered together; then either 3 of them are 
mutually acquainted, or 3 of them are mutually unacquainted. 

We turn this into a graph theory question: Consider the graph consisting of 6 vertices, 
each connected to all the others by an edge, called the complete graph on 6 vertices, and 
denoted K¢; the vertices represent the people. Color an edge red if the people represented 
by its endpoints are acquainted, and blue if they are not acquainted. Any choice of 3 
vertices defines a triangle; we wish to show that either there is a red triangle or a blue 
triangle. 

Consider the five edges incident at a single vertex v; by the Pigeonhole Principle (the 
version in corollary 1.6.7, with r = 3, X = 2(3 —1)+1=5), at least three of them are 
the same color, call it color C; call the other color D. Let the vertices at the other ends 
of these three edges be v1, v2, v3. If any of the edges between these vertices have color C, 
there is a triangle of color C: if the edge connects v; to vj, the triangle is formed by v, 


34 Chapter 1 Fundamentals 


v;, and v;. If this is not the case, then the three vertices v;, v2, v3 are joined by edges of 


color D, and form a triangle of color D. 


The number 6 in this example is special: with 5 or fewer vertices it is not true that 
there must be a monochromatic triangle, and with more than 6 vertices it is true. To see 
that it is not true for 5 vertices, we need only show an example, as in figure 1.6.1. 


Figure 1.6.1 An edge coloring with no monochromatic triangles. 


The Ramsey number R(7) is the smallest integer n such that when the edges of kK, 
are colored with two colors, there is a monochromatic complete graph on i vertices, Kj, 
contained within K,,. The example shows that R(3) = 6. 

More generally, R(i,7) is the smallest integer n such that when the edges of K,, are 
colored with two colors, say Cy; and C, either there is a K; contained within K,, all of 
whose edges are color C;, or there is a Kj; contained within K,, all of whose edges are 
color C2. Using this notion, R(k) = R(k,k). More generally still, R(i1,i2,...,im) is the 
smallest integer n such that when the edges of K,, are colored with m colors, Cy,...,Cm, 
then for some 7 there is a K;, contained in K,, all of whose edges are color Cj. 

Ramsey proved that in all of these cases, there actually is such a number n. General- 
izations of this problem have led to the subject called Ramsey Theory. 

Computing any particular value R(i, 7) turns out to be quite difficult; Ramsey numbers 
are known only for a few small values of 7 and 7, and in some other cases the Ramsey number 
is bounded by known numbers. Typically in these cases someone has exhibited a K,, and 
a coloring of the edges without the existence of a monochromatic K; or K; of the desired 
color, showing that R(i,7) > m; and someone has shown that whenever the edges of K,, 
have been colored, there is a K; or K; of the correct color, showing that R(i,j) <n. 


Exercises 1.6. 


1. Assume that the relation “friend” is symmetric. Show that if n > 2, then in any group of n 
people there are two with the same number of friends in the group. 


9. 
10. 


1.7 Sperner’s Theorem 35 


. Suppose that 501 distinct integers are selected from 1...1000. Show that there are distinct 


selected integers a and b such that a | b. Show that this is not always true if 500 integers are 
selected. => 


. Each of 15 red balls and 15 green balls is marked with an integer between 1 and 100 inclusive; 


no integer appears on more than one ball. The value of a pair of balls is the sum of the 
numbers on the balls. Show there are at least two pairs, consisting of one red and one green 
ball, with the same value. Show that this is not necessarily true if there are 13 balls of each 
color. 


. Suppose we have 14 red balls and 14 green balls as in the previous exercise. Show that at 


least two pairs, consisting of one red and one green ball, have the same value. What about 
13 red balls and 14 green balls? 


. Suppose (a1,@2,...,@52) are integers, not necessarily distinct. Show that there are two, a; 


and a; with i € j, such that either a; + a; or a; — a; is divisible by 100. Show that this is 
not necessarily true for integers (a1, @2,...,@51). 


. Suppose five points are chosen from a square whose sides are length s. (The points may be 


either in the interior of the square or on the boundary.) Show that two of the points are at 
most s\/2/2 apart. Find five points so that no two are less than s/2/2 apart. 


. Show that if the edges of A are colored with two colors, there are at least two monochromatic 


triangles. (Two triangles are different if each contains at least one vertex not in the other. 
For example, two red triangles that share an edge count as two triangles.) Color the edges 
of Ke so that there are exactly two monochromatic triangles. 


. Suppose the edges of a Ks are colored with two colors, say red and blue, so that there are 


no monochromatic triangles. Show that the red edges form a cycle, and the blue edges form 
a cycle, each with five edges. (A cycle is a sequence of edges {v1, v2}, {v2,u3},...,{Uk, vr}, 
where all of the v; are distinct. Note that this is true in figure 1.6.1.) 


Show that 8 < R(3,4) < 10. 
Show that R(3,4) = 9. 


1.7 SPERNER’S THEOREM 


The binomial coefficients count the subsets of a given set; the sets themselves are worth 


looking at. First some convenient notation: 


DEFINITION 1.7.1 Let [n| = {1,2,3,...,n}. Then 2!" denotes the set of all subsets 


of [n], and [7] denotes the set of subsets of [n] of size k. 


EXAMPLE 1.7.2 Let n=3. Then 


lo] = 

| = (2, 2}, (3H) 

|| = {2,2}, {1,3} {2,3} 
[3] = (01.2.3) 
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DEFINITION 1.7.3. A chain in 2! is a set of subsets of 2! that are linearly or- 
dered by inclusion. An anti-chain in 2!) is a set of subsets of 2!"! that are pairwise 


incomparable. 


EXAMPLE 1.7.4 In 2!), {9, {1}, {1,2,3}} is a chain, because @ C {1} C {1,2,3}. 
Every [7] is an anti-chain, as is {{1}, {2,3}}. The set {{1}, {1,3}, {2,3}} is neither a 
chain nor an anti-chain. 


k 
largest are the “middle” ones, namely ee /| and ial (which are the same if n is 


Because of theorem 1.3.4 we know that among all anti-chains of the form ed the 


even). Remarkably, these are the largest of all anti-chains, that is, strictly larger than 

every other anti-chain. When n = 3, the anti-chains [?] and [3] are the only anti-chains 

of size 3, and no anti-chain is larger, as you can verify by examining all possibilities. 
Before we prove this, a bit of notation. 


DEFINITION 1.7.5 Ifo: A— Ais a bijection, then o is called a permutation. 


This use of the word permutation is different than our previous usage, but the two are 
closely related. Consider such a function a: [n] — [n]. Since the set A in this case is finite, 
we could in principle list every value of o: 


a(1),a(2),e(3),:<.4o (i): 


This is a list of the numbers {1,...,2} in some order, namely, this is a permutation 
according to our previous usage. We can continue to use the same word for both ideas, 


relying on context or an explicit statement to indicate which we mean. 


THEOREM 1.7.6 (Sperner’s Theorem) The only anti-chains of largest size are ins fl 


and [n/2] : 


Proof. First we show that no anti-chain is larger than these two. We attempt to partition 


2l"] into k = ( [n/a ) chains, that is, to find chains 


Aig G Ais © Aye Gee C Ain 
Azo © Aoi © Aaa ©-++ C Aams 


Ape GC Apa C Apa Ge > C Apa, 
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so that every subset of [n] appears exactly once as one of the A;,;. If we can find such 
a partition, then since no two elements of an anti-chain can be in the same chain, no 
anti-chain can have more than k elements. 


For small values of n this can be done by hand; for n = 3 we have 
OC {FC {1,23 ¢ {1,2,3} 
{2} © {2, 3} 
{3} C {1,3} 


These small cases form the base of an induction. We will prove that any 2'"] can be 
partitioned into such chains with two additional properties: 


1. Each set in a chain contains exactly one element more than the next smallest set 
in the chain. 


2. The sum of the sizes of the smallest and largest element in the chain is n. 


Note that the chains for the case n = 3 have both of these properties. The two properties 
taken together imply that every chain “crosses the middle”, that is, every chain contains 
an element of n/2 if n is even, and an element of both Al and acer | if n is odd. 


Thus, if we succeed in showing that such chain partitions exist, there will be exactly (iny2 ) 
chains. 

For the induction step, we assume that we have partitioned 2!"~") into such chains, 
and construct chains for 2!”!. 

First, for each chain Ajo C Aji C --: C Aim, we form a new chain Ajo C Aji C 
-¥ C Ayan, © Apm, U {nr}. Since | A; o| + Ain, = n= L, |Atol- |Aimn-U {rn} =n; 86 this 


new chain satisfies properties (1) and (2). 
In addition, if m; > 0, we form a new chain Ajo U{n} C Aji U{n} C--- C Aim,—-1U 
{n}. Now 


|Ai,o U {n}| F Ai gat U {n}| i |Aio ae \Aerns= | See | 
= |Ajo/ +14+|Aim,|-—14+1 
=n-l+l=n 


so again properties (1) and (2) are satisfied. 

Because of the first type of chain, all subsets of [n — 1] are contained exactly once in 
the new set of chains. Also, we have added the element n exactly once to every subset of 
[2 — 1], so we have included every subset of [n] containing n exactly once. Thus we have 
produced the desired partition of piel, 

Now we need to show that the only largest anti-chains are ee /| and Fe is 

Suppose that Ai, Ag,..., Am is an anti-chain; then Af, AS,..., AS, is also an anti-chain, 
where A° denotes the complement of A. Thus, if there is an anti-chain that contains some 
A with |A] > [n/2], there is also one containing A‘, and |A‘°| < |n/2|. Suppose that some 
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anti-chain contains a set A with |A| < |n/2|. We next prove that this anti-chain cannot 
be of maximum size. 

Partition 2!) as in the first part of the proof. Suppose that A is a subset of the 
elements of a one or two element chain C’, that is, a chain consisting solely of a set $1 of 
size n/2, if n is even, or of sets S; and Sp of sizes |n/2] and [n/2], with A C S; C So, if 
n is odd. Then no member of C is in the anti-chain. Thus, the largest possible size for an 
anti-chain containing A is (iny2 )) —1. 

If A is not a subset of the elements of such a short chain, we now prove that there 
is another chain partition of 2!"! that does have this property. Note that in the original 
chain partition there must be a chain of length 1 or 2, Ci, consisting of S; and possibly 
Sg; if not, every chain would contain a set of size |n/2| — 1, but there are not enough 
such sets to go around. Suppose then that A = {21,...,2,} and the set 5; in Ci is 
Sy = {21,..-,2q,Yg+is--- yt, whereO<q<kandI>k. 

Let o be the permutation of [n] such that o(@q4i) = Yq+i and o(Yq+i) = Lq4i, for 
1<i<k-—gq, and ca fixes all other elements. Now for U C [n], let U = o(U), and note 
that U C V if and only if U C V. Thus every chain in the original chain partition maps 
to a chain. Since o is a bijection, these new chains also form a partition of 2!"!, with the 
additional properties (1) and (2). By the definition of 7, A C $1, and {51, $2} is a chain, 
say C;. Thus, this new chain partition has the desired property: A is a subset of every 
element of the 1 or 2 element chain Cy, so A is not in an anti-chain of maximum size. 

Finally, we need to show that if n is odd, no anti-chain of maximum size contains 


sets in both Me i and ra |. Suppose there is such an anti-chain, consisting of sets 
AjaiyssgAy a Re , where | = (raya) and B,,..., By, in Fan . The remaining sets 
in ray | are A,,..., Az, and the remaining sets in all are Byii,.-., By. 


Each set B;, 1 <i < k, is contained in exactly [n/2] sets in trail and all must 
be among Aj,...,A,. On average, then, each A;, 1 < i < k, contains [n/2] sets among 
B,,..., By. But each set Aj, 1 < i < k, contains exactly [n/2] sets in hear and so 
each must contain exactly [n/2] of the sets B,,...,B, and none of the sets Byii,..., Br. 

het Ay = Ay, = {igs 0p} and Bat = {Fig Pe ee thei: Les = 
Aj, \{@s+m} and A;,,,, = Bi,, U{ys+m}, for 1 <m < r—s—1. Note that by the preceding 
discussion, |< 44, <6 and. Is g4 = kh Then Ay. . = 4 Mijssey Bay Wess 4 Uri, et 80 
A > By41, a contradiction. Hence there is no such anti-chain. a 


Jr-s = 


Exercises 1.7. 
1. Sperner’s Theorem (1.7.6) tells us that [als with size 20, is the unique largest anti-chain 
for 2'°]. The next largest anti-chains of the form [| are | and eae with size 15. Find a 
maximal anti-chain with size larger than 15 but less than 20. (As usual, maximal here means 
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that the anti-chain cannot be enlarged simply by adding elements. So you may not simply 
use a subset of Ra .) 


1.8 STIRLING NUMBERS 


In exercise 4 in section 1.4, we saw the Stirling numbers of the second kind. Not surpris- 
ingly, there are Stirling numbers of the first kind. Recall that Stirling numbers of the 
second kind are defined as follows: 


DEFINITION 1.8.1 The Stirling number of the second kind, $(n,k) or {7}, is the 
number of partitions of [n] = {1,2,...,n} into exactly k parts, l<k <n. 


Before we define the Stirling numbers of the first kind, we need to revisit permutations. 
As we mentioned in section 1.7, we may think of a permutation of [n] either as a reordering 
of [n] or as a bijection o:[n| > [n]. There are different ways to write permutations when 
thought of as functions. Two typical and useful ways are as a table, and in cycle form. 
Consider this permutation o: [5] — [5]: o(1) = 3, o(2) = 4, o(8) = 5, o(4) = 2, o(5) = 1. 


12345 
34521 


write “o” five times. In cycle form, we write this same permutation as (1,3, 5)(2,4). Here 
(1,3,5) indicates that o(1) = 3, o(3) = 5, and o(5) = 1, whiile (2,4) indicates o(2) = 4 
and o(4) = 2. This permutation has two cycles, a 3-cycle and a 2-cycle. Note that (1,3,5), 


In table form, we write this as ( ), which is somewhat more compact, as we don’t 


(3,5, 1), and (5, 1,3) all mean the same thing. We allow 1-cycles to count as cycles, though 


sometimes we don’t write them explicitly. In some cases, however, it is valuable to write 
eee 
345.3916)" 


If we write this in cycle form as (1,3,5)(2, 4), which is correct, there is no indication that 
the underlying set is really [6]. Writing (1,3, 5)(2,4)(6) makes this clear. We say that this 
permutation has 3 cycles, even though one of them is a trivial 1-cycle. Now we’re ready 


them to force us to remember that they are there. Consider this permutation: ( 


for the next definition. 


DEFINITION 1.8.2 The Stirling number of the first kind, s(n, k), is (—1)"~* times 
the number of permutations of [n] with exactly k cycles. The corresponding unsigned 
Stirling number of the first kind, the number of permutations of [n] with exactly k 


cycles, is |s(n, k)|, sometimes written [7]. Using this notation, s(n, k) = (—1)"~* [7]. 


Note that the use of Ln] conflicts with the use of the same notation in section 1.7; 
there should be no confusion, as we won’t be discussing the two ideas together. 
Some values of [a] are easy to see; ifn > 1, then 


[| =1 || =O ifk>n 


Heo-m fees 
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It is sometimes convenient to say that [3] = 1. These numbers thus form a triangle in the 
obvious way, just as the Stirling numbers of the first kind do. Here are lines 1—5 of the 


triangle: 
1 
0 1 
OG dt JF 
OG 2 8 2 
0 6 11 6 1 


0 24 50 35 10 1 
The first column is not particularly interesting, so often it is eliminated. 
In exercise 4 in section 1.4, we saw that 


Ca a, = 


The unsigned Stirling numbers of the first kind satisfy a similar recurrence. 


THEOREM 1.8.3 [7] =[?7}]+(n-1)-[%"], k2Ln21. 
Proof. The proof is by induction on n; the table above shows that it is true for the first 
few lines. We split the permutations of [n] with k cycles into two types: those in which (n) 
is a l-cycle, and the rest. If (n) is a 1-cycle, then the remaining cycles form a permutation 
of [n — 1] with k — 1 cycles, so there are bea of these. Otherwise, n occurs in a cycle 
of length at least 2, and removing n leaves a permutation of [n — 1] with k cycles. Given 
a permutation o of [n — 1] with k cycles, n can be added to any cycle in any position to 
form a permutation of [n] in which (n) is not a l-cycle. Suppose the lengths of the cycles 
in o are l,,l2,...,1,%. In cycle number 7, n may be added after any of the 1; elements in the 
cycle. Thus, the total number of places that n can be added is 11 +12 +---+l, =n—1, 80 


there are (n —1)- eal permutations of [n] in which (7) is not a 1-cycle. Now the total 


number of permutations of [n] with k cycles is aot +(n-1)- eral as desired. Zz 
COROLLARY 1.8.4 s(n, k) = s(n—1,k —1) — (n—1)s(n—1,k). 7 


The Stirling numbers satisfy two remarkable identities. First a definition: 


DEFINITION 1.8.5 The Kronecker delta 6,,;, is 1 ifn =k and 0 otherwise. 


THEOREM 1.8.6 Forn>0 and k>0, 
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Proof. We prove the first version, by induction on n. The first few values of n are easily 

checked; assume n > 1. Now note that [* = 0, so we may start the sum index 7 at 1. 
When k > n, eae = 0, for 1 < 7 < n, and so the sum is 0. When k = n, the only 

non-zero term occurs when j = n, and is (—1)° [| { a, = 1, so the sum is 1. Now suppose 

k <n. When k = 0, wet = (0 for 7 > 0, so the sum is 0, and we assume now that k > 0. 
We begin by applying the recurrence relations: 


rf) Eber (Gry +o9 YB 


i] 3 
P amie. a 

| 

KH 


Bem Gal Bere-ofF 16) 
= Seu Pot] (rt eefZ Peo fF] 
er eee 
Yer oF] 


Consider the first sum in the last expression: 


seo Soap yoo Baa] 


j=l j=2 
= n—-1 j 
Bor Pe) 
ey ae 
= Opotea— U, 


since k — 1 <n-—1 (or trivially, if k = 1). Thus, we are left with just two sums. 
n eae a fea us _ (eect OT ihe) 
= 9 —1)" -4¥(n-1 
ee aa ee OT aes 
= n—-1] {j = n—1] {j 
=kS\°(-1)?-3-1 | =e Sy 
Se ay en | 


= Kg 2t = (n = 1) Oaks 


Now if k = n —1, this is (n — 1)dn-1,n-1 — (n — 1)bn_1n_1 = 0, while if k < n—1 it is 
Kigoiie == Donate = he Se 1) 0 = 0: a 


A2 
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If we interpret the triangles containing the s(n,k) and S(n,k) as matrices, either 


m xm, by taking the first m rows and columns, or even the infinite matrices containing 


the entire triangles, the sums of the theorem correspond to computing the matrix product 


in both orders. The theorem then says that this product consists of ones on the diagonal 


and zeros elsewhere, so these matrices are inverses. Here is a small example: 


1 0 0 0 0 O lo 0 0 Q 0 10 0 0 0 0 
O J 0 0 0 O 0 1 0 0 0 0 0 10 0 0 0 
? = 1 0 0 O 01 1 0 0 07 =|O0 0 1 0 0 0 
0 2 -3 1 0 0 01 3 1 0 Of |0 0 0 1 0 0 
0 -=6 IT <=6 <1. Od ee & LT 4 0 0 0 0 1 =O 
0 24 -50 35 —-10 1 O 1 15 25 10 1 000 00 1 


Exercises 1.8. 


ee i 


10. 


11. 


Find a simple expression for | : 


Find a simple expression for els 
What is )o7 9 [n]? 
What is 779 s(n, k)? 


Show that «” = []?—3(@—k) = 0", s(n, i)a’, n > 1; x” is called a falling factorial. Find 


a similar identity for x” = ia (c+k); x” is a rising factorial. 
“ nr kon P k-: ; : - 3 
Show that > { k \ x~ = x, n> 1; x is defined in the previous exercise. The previous 
k=0 


exercise shows how to express the falling factorial in terms of powers of x; this exercise shows 
how to express the powers of x in terms of falling factorials. 


. Prove: S(n,k) = » ("; 1) ste—v. 


i=k-1 
n = n—1 a 
Prove: 4 = Ds nian ; lial 
t=k-1 
n—1 1 
Use the previous exercise to prove s(n, k) = (-1)""*""(n-i-1)! ("; Js k—1). 
i=k-1 


We have defined [7] and 17} for n,k > 0. We want to extend the definitions to all integers. 
Without some extra stipulations, there are many ways to do this. Let us suppose that for 
n #~ 0 we want | [°] ee { a 0, and we want the recurrence relations of 
equation 1.8.1 and in theorem 1.8.3 to be true. Show that under these conditions there is a 


unique way to extend the definitions to all integers, and that when this is done, { ct = re 
for all integers n and k. Thus, the extended table of values for either el or { att will contain 
all the values of both [7] and {7}. 

Under the assumptions that s(n,0) = s(0,n) = 0 for n £ 0, and s(n,k) = s(n —1,k—1) — 
(n—1)s(n—1,k), extend the table for s(n, k) to all integers, and find a connection to S(n, k) 
similar to that in the previous problem. 
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12. Prove corollary 1.8.4. 


13. Prove the remaining part of theorem 1.8.6. 


2 


Inclusion-Exclusion 


2.1 THE INCLUSION-EXCLUSION FORMULA 


Let’s return to a problem we have mentioned but not solved: 


EXAMPLE 2.1.1 How many submultisets of the multiset {2-a,4-b,3-c} have size 7? 

We recast the problem: this is the number of solutions to 7; + #2 + #3 = 7 with 
O< a4, < 2,0 < 42 < 4,0 < 23 < 3. We know that the number of solutions in non- 
a) = Cy so this is an overcount, since we count solutions that 
do not meet the upper bound restrictions. For example, this includes some solutions with 


negative integers is ( 


x, > 3; how many of these are there? This is a problem we can solve: it is the number 
of solutions to 71 + 72 + #3 = 7 with 3 < x, 0 < x2, 0 < x3. This is the same as the 
number of non-negative solutions of y; + y2 + y3 = 7-3 = 4, or (22) = (5). Thus, 
(5) _ (5) corrects this overcount. If we likewise correct for the overcounting of solutions 
with x2 > 5 and x3 > 4, we get (5) — ©) — (5) — 6). Is this correct? Not necessarily, 
because we now have a potential undercount: we have twice subtracted 1 for a solution in 
which both 2; > 3 and x2 > 5, when we should have subtracted just 1. However, by good 
fortune, there are no such solutions, since 3+ 5 > 7. But the same applies to the other 
pairs of variables: How many solutions have x7; > 3 and x3 > 4? It’s easy to see there is 
only one such solution, namely 3+ 0+4= /7. Finally, there are no solutions with x2 > 5 
and x3 > 4, so the corrected count is now (3) — (3) _ (5) — @ +1. This does not take into 
account any solutions in which x; > 3, x2 > 5, and x3 > 4, but there are none of these, so 


the actual count is 


()-G) - (2) - G) +1 = 88-15-6109 +1 = 6. 
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This is small enough that it is not hard to verify by listing all the solutions. 


So we solved this problem, but it is apparent that it could have been much worse, 
if the number of variables were larger and there were many complicated overcounts and 
undercounts. Remarkably, it is possible to streamline this sort of argument; it will still, 
often, be quite messy, but the reasoning will be simpler. 

Let’s start by rephrasing the example. Let S be the set of all non-negative solutions 
to 71 +. 22+ 23 = 7, let A, be all solutions with x, > 3, Ag all solutions with x2 > 5, and 
Ag all solutions with x3 > 4. We want to know the size of A[ M A511 AS, the solutions 
for which it is not true that 7; > 3 and not true that ry > 5 and not true that x3 > 4. 
Examining our solution, we see that the final count is 


|S| — |Ai| — |Ag| — |A3] + |A1 N A2| + |A1 NM A3] + |A2n As] — |A1N Ag As3| 
= 36 -15-6—-10+0+1+0-—0. 


This pattern is completely general: 
THEOREM 2.1.2. The inclusion-exclusion formula If A; C S for 1 <i<n then 
JAS ---N AP] = [S| —|Ai] —--- — |An| + ]A1M Ag] +----—|A1N A2N As] —---, 


or more compactly: 


n n k 
1) 48] = 151+ 5-1)" S017 Aas, 
i=1 k=1 j=l 


where the internal sum is over all subsets {i1,i2,...,7,} of {1,2,...,n}. Alternately we 
may write 

(Ane SCO) 4yl- 

i=l IC[n] jel 


(Note that () A; =S8.) 
jE 
Proof. We need to show that each element of ();"_, Af is counted once by the right hand 
side, and every other element of S is counted zero times. The first of these is easy: if 
x € {)j_, Af then for every i, x ¢ A;, so x is in none of the sets involving the A; on the 
right hand side, and so x is counted, once, by the term |S]. 
Now suppose x ¢ {);_, Af. On the right hand side, x is counted once by the term 


|S|. For some values 71, i2,...,i,%, v € Aj,,, 1<m<k, and z is not in the remaining sets 
A;. Then x is counted zero times by any term involving an A; with i ¢ {t1,72,...,ix}, 
and is counted once, positively or negatively, by each term involving only A;,, Ai,,..., Ai,.- 


There are k terms of the form —|A;,,|, which count x a total of —k times. There are (5) 


2.1 The Inclusion-Exclusion Formula AT 


terms of the form |A;, 9 A;,,|, counting x a total of CG) times. Continuing in this way, we 
see that the final count for x on the right hand side is 


(Qe) 


(!) 


i=0 


or more compactly 


We know that this alternating sum of binomial coefficients is zero, so x is counted zero 
times, as desired. (See equation 1.3.1.) a 


An alternate form of the inclusion exclusion formula is sometimes useful. 


COROLLARY 2.1.3 If A; C S for 1 <i<nthen 


n 


n k 
Ul = 0 O17) Ail 


k=1 
where the internal sum is over all subsets {i1,72,..., ix} of {1,2,...,m}. 


Proof. Since (Uj, 4i)° = Ne Af, 
IL Ail = 151-1) 42 
=i i=4. 
k 
= |$|- (81+ 30-1)" 7) 1) Ag) 
k=1 j=l 


nm 


k 
= (-1) $0(-1)* $0] f) Ai, 
k=1 j=l 


n 


k 
= eae S-| (| A;,|- 


k=1 


Since the right hand side of the inclusion-exclusion formula consists of 2” terms to 
be added, it can still be quite tedious. In some nice cases, all intersections of the same 
number of sets have the same size. Since there are cs possible intersections consisting of 
k sets, the formula becomes 


I) Af = ISI + ne (me (2.1.1) 
i=1 k=1 


where my, is the size of an intersection of k of the sets. 
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EXAMPLE 2.1.4 Find the number of solutions to 71 +%2+23+24 = 25,0 <2; < 10. 
Let A; be the solutions of x1 + v2 + 43 + %4 = 25 with x; > 11. The number of solutions 


with x; > 0 for all zis ere = i) Also |A;| = oe), and |A;N.A;| = ‘eur There 


are no solutions with 3 or 4 of the variables larger than 10. Hence the number of solutions 


OP) (VE) OCS) =o 


Exercises 2.1. 
1. List all 6 solutions to the restricted equation in example 2.1.1, and list the corresponding 6 
submultisets. 


2. Find the number of integer solutions to 71 + 72 + %3 + a4 = 25,1 <1 < 6,2 < 22 < 8, 
O<a3<8,5<a44< 9. 


3. Find the number of submultisets of {25 - a, 25 - b,25-c,25- d} of size 80. 


4. Recall that 17} is a Stirling number of the second kind (definition 1.8.1). Prove that for 
n>k>0, 


Do n = 0 as a special case, then use inclusion-exclusion for the rest. You may assume, by 
convention, that 0° = 1. 


2.2 FORBIDDEN POSITION PERMUTATIONS 


Suppose we shuffle a deck of cards; what is the probability that no card is in its original 
location? More generally, how many permutations of [n] = {1,2,3,...,n} have none of 
the integers in their “correct” locations? That is, 1 is not first, 2 is not second, and so on. 
Such a permutation is called a derangement of [n]. 

Let S be the set of all permutations of [n] and A; be the permutations of [n] in which 
i is in the correct place. Then we want to know |();_, A$]. 

For any 7, |A;| = (n — 1)!: once 2 is fixed in position i, the remaining n — 1 integers 
can be placed in any locations. 

What about |A;7 A,|? If both ¢ and j are in the correct position, the remaining n — 2 
integers can be placed anywhere, so |A;MA;| = (n — 2)!. 


2.2 Forbidden Position Permutations A9 


In the same way, we see that |A;,A;,9---N Aj, | = (n—k)!. Thus, by the inclusion- 
exclusion formula, in the form of equation 2.1.1, 


441 =Isi+ i-n8(Z) a — 
i=1 k=1 


_ k 
=n+ 5 (-1) in mi” —k)! 
k=1 
_ = pn 
=n! So(-)) A 
k=1 
ee nl Sot 
k=1 kl 
= i 
— 7! _7)k 
n @ + 2| 1) =) 
aaa a 
k=0 
The last sum should look familiar: 
x = 1 k 
e = Ss kl” 
k=0 
Substituting « = —1 gives 
| 
= k 
oe 5 a) 
k=0 


The probability of getting a derangement by chance is then 


1c 1 4-1 
ie > (-1) a >-(-1) El 
k=0 


k=0 


and when n is bigger than 6, this is quite close to 
e~| = 0.3679. 


So in the case of a deck of cards, the probability of a derangement is about 37%. 


Let D, = n! ar eas These derangement numbers have some interesting 


properties. First, note that when n = 0, we have Do = o!(—1)°= = 1: 


of the empty set” doesn’t really make sense, but it is useful to adopt the convention that 
Dy =1. 


“Derangements 
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The derangements of [n] may be produced as follows: For each 7 € {2,3,...,n}, put i 
in position 1 and 1 in position 7. Then permute the numbers {2,3,...,i—1,i+1,...n} 
in all possible ways so that none of these n — 2 numbers is in the correct place. There are 
D,~2 ways to do this. Then, keeping 1 in position 7, derange the numbers {7, 2,3,...,i— 
1,i+1,...n}, with the “correct” position of i now considered to be position 1. There are 
Dy-1 ways to do this. Thus, D, = (n — 1)(Dn-1 + Dn-2). Starting with Do = 1 and 
D, = 0, this gives Dz = (1)(0+ 1) = 1 and D3 = (2)(1 + 0) = 2, both of which are easy 
to check directly. 

We explore this recurrence relation a bit: 


Dy =nDpn—-1 — Dn-1 + (n — 1)Dn-2 (x) 
= nDy_1 — (n — 2)(Dn_2 + Dn_3) + (n — 1)Dn_2 
= nDy_1 — (n — 2)Dn_2 — (n — 2)Dyn_3 + (n—1)Dn_2 
=n Dy Dag = (=D) Dia (+) 
= nD 3) (Dasa + Dia) = 2) Dies 
=nD,_-1 + (n — 3)Dn_3 + (n — 3) Dpn_a — (n — 2)Dp_s 
= nDy_-1 — Dn-3 + (n — 3)Dn_4 (x) 
=nDn—-1 — (n— 4)(Dn_-4 + Dn_s) + (n — 3)Dy_a 
= nDy_1 — (n—4)Dn_a — (n — 4)Dy_5 + (n — 3)Dpn_a 
A Dia Dyce SA) Dice, (+) 


It appears from the starred lines that the pattern here is that 
Dn = NDn—1 + (—1)"Dn_—e + (—1)**1(n — k)Dy_e-1. 
If this continues, we should get to 
Dy = nDy—1 + (-1)"-? De + (-1)"71(2) D1. 
Since Dy = 1 and D,; = 0, this would give 
Dy =nDy-i + (-1)”, 


since (—1)" = (—1)"~?. Indeed this is true, and can be proved by induction. This gives a 
somewhat simpler recurrence relation, making it quite easy to compute D,,. 


There are many similar problems. 
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EXAMPLE 2.2.1 How many permutations of [n] contain no instance of i followed by 
ie ome is 
By a similar use of the inclusion-exclusion formula, it turns out that this is 


= 1 = = ae, 
Qn =n! ee: +(n—1)! dy Gor 


Note that the limits on the two sums are not identical. 


Exercises 2.2. 


1. Prove that Dn = nD,y—1 + (—1)” when n > 1, by induction on n. 


. Prove that D, is even if and only if n is odd. 


2 
3. Provide the missing details for example 2.2.1. What is lim Qn» 
4 


. Find the number of permutations of 1,2,...,8 that have no odd number in the correct 
position. 
5. Find the number of permutations of 1,2,...,8 that have at least one odd number in the 


correct position. 


6. How many permutations of [n] have exactly k numbers in their correct positions? 


n! = 3 (;) ee 


k=0 


7. Give a combinatorial proof that 


8. A small merry-go-round has 8 seats occupied by 8 children. In how many ways can the 
children change places so that no child sits behind the same child as on the first ride? The 
seats do not matter, only the relative positions of the children. 


9. Repeat the previous problem with n instead of 8. 


10. On the way into a party everyone checks a coat and a bag at the door. On the way out, the 
attendant hands out coats and bags randomly. In how many ways can this be done if 


(a) No one gets either their own coat or their own bag? 
(b) One may get one’s own coat, or bag, but not both. 


11. Suppose n people are seated in m > n chairs in a room. At some point there is a break, and 
everyone leaves the room. When they return, in how many ways can they be seated so that 
no person occupies the same chair as before the break? 


3 


Generating Functions 


As we have seen, a typical counting problem includes one or more parameters, which of 
course show up in the solutions, such as Ce P(n,k), or the number of derangements of 


[n]. Also recall that 
(@@+1)"=>> Ge 


k=0 


This provides the values (2) as coefficients of the Maclaurin expansion of a function. This 


turns out to be a useful idea. 


DEFINITION 3.0.1 f(x) is a generating function for the sequence ao, a1, d@2,... if 


fe) = iS aja’. 
i=0 


Sometimes a generating function can be used to find a formula for its coefficients, but 
if not, it gives a way to generate them. Generating functions can also be useful in proving 
facts about the coefficients. 


3.1 NEWTON’S BINOMIAL THEOREM 


Recall that 


Ge n! n(n —1)(n—2)---(n-—k+1) 
= Foo kl 
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The expression on the right makes sense even if n is not a non-negative integer, so long as 
k is a non-negative integer, and we therefore define 


(") _ (r= = 2) (r- b+) 


k k) 


when r is a real number. For example, 


(172) _ GAewAeanc/e 5 a) (=2)(-3)(-4) _ 


4 Al ~ 128 


These generalized binomial coefficients share some important properties of the usual 
binomial coefficients, most notably that 


t ral r=1 
= : Ee 
(i) = aa) a 
THEOREM 3.1.1 Newton’s Binomial Theorem For any real number r that is 


not a non-negative integer, 
CO 
r . 
x+1)\'= _)a’ 
@+1r => (1) 


1=0 


Then remarkably: 


when —1 <a<l. 


Proof. It is not hard to see that the series is the Maclaurin series for (2 + 1)", and that 
the series converges when —1 < x < 1. It is rather more difficult to prove that the series is 
equal to (2 +1)"; the proof may be found in many introductory real analysis books. = 


EXAMPLE 3.1.2 Expand the function (1 — «)~” when n is a positive integer. 
We first consider (2 + 1)~”; we can simplify the binomial coefficients: 


(—n)(—n — 1)(—n — 2)---(—n —i 4+ 1) (4) ee ae!) 


i! i! 
,(n+i-1)! 
i! (n — 1)! 


Thus 
= (n+i-1\ , wofnti-1 ; 
1" = = — =a)". 
ery = ey t= CPT ee 
i=0 1=0 
Now replacing x by —2 gives 


i 3 om ‘)e 


nt+i-1 


+7"), the number of submultisets of {00 - 


So (1—2)~” is the generating function for ( 


1,00: 2,...,00-n} of size i. 
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In many cases it is possible to directly construct the generating function whose coeffi- 
cients solve a counting problem. 


EXAMPLE 3.1.3 Find the number of solutions to 7, + #2 + 73 + 44 = 17, where 
O< a4, <2,0<% <5,0<%3<5,2 <4, <6. 
We can of course solve this problem using the inclusion-exclusion formula, but we use 


generating functions. Consider the function 
(te4e7\(lt+e4e? 407% 40° 4°) (ltoete? +e? 4+2*40°)(2? 40° +0° 40° 42°). 


We can multiply this out by choosing one term from each factor in all possible ways. If 
we then collect like terms, the coefficient of x* will be the number of ways to choose one 
term from each factor so that the exponents of the terms add up to k. This is precisely the 
number of solutions to 71 + %2 + 73 +%4 = k, where 0 < 271 < 2,0< 22 <5,0< 23 <5, 
2< 24 <6. Thus, the answer to the problem is the coefficient of z'’. With the help of a 
computer algebra system we get 


d+e4+e27)(l4+e4¢e? +22 42°42?) (2? 42% 4+2¢4+2°4+2°) 
= 71% 4 do)? + 10018 + 192) + 31a14 + 4521? + 58x)? + 67214 + 70219 
+ 672° + 5808 + 45a? + 312° + 192° + 1024 + 4x? + 2?, 


so the answer is 4. 


EXAMPLE 3.1.4 _ Find the generating function for the number of solutions to 7, + 
%2+%3+%44 =k, where 0 <2, <w,0< 242 <5,0< 9473 <5,2<944 <6. 
This is just like the previous example except that x; is not bounded above. The 


generating function is thus 
f(e)=A+ata?+--)A+et+a?+e%+a% tee? +e% +a°+2°4+2°) 
=(l-a) (tata? teitatte Pa? teitaettae? + 2°) 
tata? +a2+at+e°)? (22+ 23 +24+27° +28) 


L=2 
Note that (1 — x)! = (1+a2+2?+---) is the familiar geometric series from calculus; 
alternately, we could use example 3.1.2. Unlike the function in the previous example, this 
function has an infinite expansion: 
f(a) = a? + 42° + 100* + 200° + 3508 4+ 55a + 7828 
+ 1022° + 125219 + 14527" + 16021? + 1702** + 1762"4 
+179x'> + 180x'® + 1802'” + 180218 + 180x19 + 180279 +--+. 


You can see how to do this in Sage. 
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EXAMPLE 3.1.5 Find a generating function for the number of submultisets of {00 - 
a, 0o-b, co-c} in which there are an odd number of as, an even number of bs, and any number 


of cs. As we have seen, this is the same as the number of solutions to 71 +272 +273 =n in 


which x, is odd, x2 is even, and #3 is unrestricted. The generating function is therefore 


(rt+a%+a°+---)\Lta*t+e*t-.-)l+e+27?+2°+---) 


= x(1 + (2?) + (x?)? + (27)? +---)(1 + (2?) + (27)? + (2?)P +--+) 
ig) =a) 


l-«z 


Exercises 3.1. 


For some of these exercises, you may want to use the sage applet above, in example 3.1.4, or your 
favorite computer algebra system. 


1. 


Prove that (;) = (;-3) + (";')- 


2. Show that the Maclaurin series for (x + 1)” is P25 ({)2’. 
3. 
4 


. Use a generating function to find the number of solutions to 71 + x2 + 73 + 24 = 14, where 


Concerning example 3.1.4, show that all coefficients beginning with x'® are 180. 


O<a1<3,2<472<5,0< 93 <5,4<44<6. 


. Find the generating function for the number of solutions to x1 + %2 + 43 + a4 = k, where 


0<21 <0,3<422<0,2<23<5,1<a4<5. 


. Find a generating function for the number of non-negative integer solutions to 3x+2y+7z = 


n. 


. Suppose we have a large supply of red, white, and blue balloons. How many different bunches 


of 10 balloons are there, if each bunch must have at least one balloon of each color and the 
number of white balloons must be even? 


. Use generating functions to show that every positive integer can be written in exactly one 


way as a sum of distinct powers of 2. 


. Suppose we have a large supply of blue and green candles, and one gold candle. How many 


collections of n candles are there in which the number of blue candles is even, the number of 
green candles is any number, and the number of gold candles is at most one? 


3.2 EXPONENTIAL GENERATING FUNCTIONS 


There are other ways that a function might be said to generate a sequence, other than as 


what we have called a generating function. For example, 
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is the generating function for the sequence 1,1 But if we write the sum as 


1 
Te 
0° n 

x 
e= ) Lr 
nm 
n=0 


considering the n! to be part of the expression 2"/n!, we might think of this same function 
as generating the sequence 1,1,1,..., interpreting 1 as the coefficient of 2"/n!. This is not 
a very interesting sequence, of course, but this idea can often prove fruitful. If 


CO gn 
=o 
n=0 / 
we say that f(x) is the exponential generating function for ao, a1, a2,.... 


EXAMPLE 3.2.1 Find an exponential generating function for the number of permu- 
tations with repetition of length n of the set {a,b,c}, in which there are an odd number 
of as, an even number of bs, and any number of cs. 

For a fixed n and fixed numbers of the letters, we already know how to do this. For 


such permutations. Now consider 


example, if we have 3 as, 4 bs, and 2 cs, there are G " >) 


the following function: 
oo g2itl 


dX cEayD> mr 


I= 


What is the coefficient of x9/9! in this product? One way to get an x? term is 


po ae ae? 9! x? = Or ag? 
31 4! 2! 3l4r2ror (3.42) oF 
That is, this one term counts the number of permutations in which there are 3 as, 4 bs, 
and 2 cs. The ultimate coefficient of 7°/9! will be the sum of many such terms, counting 


the contributions of all possible choices of an odd number of as, an even number of bs, 
and any number of cs. 


Now we notice that S- < = e”, and that the other two sums are closely related to 
i! 
i=0 
this. A little thought leads to 


“a S(- “xt + (-2 
eee aerial 


1=0 1=0 1=0 


a 


Now 2’ + (—2)' is 22° when 7 is even, and 0 when i is odd. Thus 


Le De2% 

ew+e7%= ) — 
1? 

= (27)! 
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so that re 
ss tt = et ig? 
= (27)! 2 
A similar manipulation shows that 
ad grit _@ —et 
Dei 21 +1)! 2 
Thus, the generating function we seek is 
®_ p—k p@ —2£ 1 1 
€ € e+e ez = (e* —e*)(e +e*) = (eae) 


a 2 4 


Notice the similarity to example 3.1.5. 


Exercises 3.2. 


1. Find the coefficient of x°/9! in the function of example 3.2.1. You may use Sage or a similar 
program. 


2. Find an exponential generating function for the number of permutations with repetition of 
length n of the set {a,b,c}, in which there are an odd number of as, an even number of bs, 
and an even number of cs. 

3. Find an exponential generating function for the number of permutations with repetition of 
length n of the set {a,b,c}, in which the number of as is even and at least 2, the number of 
bs is even and at most 6, and the number of cs is at least 3. 

4. In how many ways can we paint the 10 rooms of a hotel if at most three can be painted red, 
at most 2 painted green, at most 1 painted white, and any number can be painted blue or 
orange? (The rooms are different, so order matters.) 


5. Recall from section 1.4 that the Bell numbers B, count all of the partitions of {1,2,...,n}. 


Let f(x = 5 B,, - —, and note that 
n=0 
hs = a = a < ee x” 
#) = So Bay = So Bans = (So (7) Bee) 
n=1 n=0 n=0 \k=0 


using the recurrence relation 1.4.1 for By+1 from section 1.4. Now it is possible to write this 
as a product of two infinite series: 


= (Ea-2) (Fa) sot 


Find an expression for a, that makes this true, which will tell you what g(x) is, then solve 
the differential equation for f(x), the exponential generating function for the Bell numbers. 


From section 1.4, the first few Bell numbers are 1, 1, 2, 5, 15, 52, 203, 877, 4140, 21147, 
115975, 678570, 4213597, 27644437. You can use Sage to check your answer. 
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3.3. PARTITIONS OF INTEGERS 


DEFINITION 3.3.1 A partition of a positive integer n is a multiset of positive integers 


that sum to n. We denote the number of partitions of n by pp. 


Typically a partition is written as a sum, not explicitly as a multiset. Using the usual 
convention that an empty sum is 0, we say that po = 1. 


EXAMPLE 3.3.2 The partitions of 5 are 
5 
4+1 
3+2 
3+1+1 
22d 
2+1+141 
1+14+1+1+4+41. 


Thus ps = 7. 


There is no simple formula for p,, but it is not hard to find a generating function 
for them. As with some previous examples, we seek a product of factors so that when 
the factors are multiplied out, the coefficient of 7” is p,. We would like each x” term 
to represent a single partition, before like terms are collected. A partition is uniquely 
described by the number of 1s, number of 2s, and so on, that is, by the repetition numbers 
of the multiset. We devote one factor to each integer: 


(lta+a*+a°%t---)(Lta?+a*4+a°4---)---(lta% +a + 2°% $...).-. 


| 
= 

&, 
ox 


When this product is expanded, we pick one term from each factor in all possible ways, with 
the further condition that we only pick a finite number of “non-1” terms. For example, 
if we pick x? from the first factor, 2° from the third factor, x!° from the fifth factor, 
and 1s from all other factors, we get x«?!. In the context of the product, this represents 
3-1+1-3+43-5, corresponding to the partition 1+1+1+3-+4+5+5-+5, that is, three 


1s, one 3, and three 5s. Each factor is a geometric series; the kth factor is 


1 


ae ky2 Ae ae ae 
+a" + (a")* + (a")? + Tak 


so the generating function can be written 


ee ot 
\D Gere 


k=1 
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Note that if we are interested in some particular p,, we do not need the entire infinite 
product, or even any complete factor, since no partition of n can use any integer greater 
than n, and also cannot use more than n/k copies of k. 


EXAMPLE 3.3.3 _ Find pg. 
We expand 


(I+2+a° +25 +2*+2°+2% +27 +2°)(14+0? +2442 +2%)(1+2°+2°) 
(eee ae ee rae) (aa eae) 
a eg 0g Se bat ae? BAe ba 20g awn, 


so pg = 22. Note that all of the coefficients prior to this are also correct, but the following 


coefficients are not necessarily the corresponding partition numbers. 


Partitions of integers have some interesting properties. Let pa(n) be the number of 
partitions of n into distinct parts; let p(n) be the number of partitions into odd parts. 


EXAMPLE 3.3.4 For n = 6, the partitions into distinct parts are 
6,54+1,44+2,34+241, 
so pq(6) = 4, and the partitions into odd parts are 


pe ee ee ee se ee re ee oe ee 


SO po(6) = 4. 


In fact, for every n, pa(n) = po(n), and we can see this by manipulating generating 
functions. The generating function for pa(n) is 


fa(x) =(14+2)(1+a?)(1+2%)---=][(+2"). 
i=1 

The generating function for p(n) is 

fo(x) = (ee oe eee ee ee ee | 

e ~~ =] 1 — 72itl 

i=0 
We can write 
_1l-a? 1-24 1-2° 


fal) = f6 Pa Sa 


and notice that every numerator is eventually canceled by a denominator, leaving only the 
denominators containing odd powers of x, so fy(x) = fo(z). 
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We can also use a recurrence relation to find the partition numbers, though in a 
somewhat less direct way than the binomial coefficients or the Bell numbers. Let pz(n) be 
the number of partitions of n into exactly k parts. We will find a recurrence relation to 
compute the p;,(n), and then 


Pn = iS Pk (n). 
k=1 


Now consider the partitions of n into k parts. Some of these partitions contain no 1s, like 
3+3+4-+6, a partition of 16 into 4 parts. Subtracting 1 from each part, we get a partition 
of n — k into k parts; for the example, this is 2+ 2+3+5. The remaining partitions of 
n into k parts contain a 1. If we remove the 1, we are left with a partition of n — 1 into 
k — 1 parts. This gives us a 1-1 correspondence between the partitions of n into k parts, 
and the partitions of n — k into k parts together with the partitions of n — 1 into k — 1 
parts, so pe(n) = p(n — k) + pp_i(n — 1). 

Using this recurrence we can build a triangle containing the p;,(n), and the row sums of 
this triangle give the partition numbers. For all n, p;(n) = 1, which gives the first column 
of the triangle, after which the recurrence applies. Also, note that p,(n) = 0 when k > n 
and we let p,(0) = 0; these are needed in some cases to compute the pz(n — k) term of 
the recurrence. Here are the first few rows of the triangle; at the left are the row numbers, 
and at the right are the row sums, that is, the partition numbers. For the last row, each 
entry is the sum of the like-colored numbers in the previous rows. Note that beginning 
with p4(7) = 3 in the last row, px(7) = pr_-1(6), as pe(7 — k) = 0. 


1 1 0 1 
2 1 1 0 2 
3 11 éi1£ +0 3 
41211 5 
5 1 22 1 1 7 
6 i335 2tH1i1 uat| 
1 Io. 44 2 1.1. ts 


Yet another sometimes useful way to think of a partition is with a Ferrers diagram. 
Each integer in the partition is represented by a row of dots, and the rows are ordered 
from longest on the top to shortest at the bottom. For example, the partition 3+3+4+5 
would be represented by 


The conjugate of a partition is the one corresponding to the Ferrers diagram produced 
by flipping the diagram for the original partition across the main diagonal, thus turning 
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rows into columns and vice versa. For the diagram above, the conjugate is 


with corresponding partition 1+2+4-+4+44. This concept can occasionally make facts 
about partitions easier to see than otherwise. Here is a classic example: the number of 
partitions of n with largest part k is the same as the number of partitions into k parts, 
p(n). The action of conjugation takes every partition of one type into a partition of the 
other: the conjugate of a partition into k parts is a partition with largest part k and vice 
versa. This establishes a 1—1 correspondence between partitions into k parts and partitions 
with largest part k. 


Exercises 3.3. 


1. Use generating functions to find pis. 


2. Find the generating function for the number of partitions of an integer into distinct odd 
parts. Find the number of such partitions of 20. 


3. Find the generating function for the number of partitions of an integer into distinct even 
parts. Find the number of such partitions of 30. 


4. Find the number of partitions of 25 into odd parts. 


5. Find the generating function for the number of partitions of an integer into k parts; that is, 
the coefficient of x” is the number of partitions of n into k parts. 


6. Complete row 8 of the table for the px(n), and verify that the row sum is 22, as we saw in 
example 3.3.3. 


7. A partition of n is self-conjugate if its Ferrers diagram is symmetric around the main diagonal, 
so that its conjugate is itself. Show that the number of self-conjugate partitions of n is equal 
to the number of partitions of n into distinct odd parts. 


3.4 RECURRENCE RELATIONS 


A recurrence relation defines a sequence {a;}°2, by expressing a typical term a, in 
terms of earlier terms, a; for 7 < n. For example, the famous Fibonacci sequence is defined 
by 

Fo =0,F, = 1, Fy, = Fn-i+ Fr-2. 


Note that some initial values must be specified for the recurrence relation to define a unique 
sequence. 
The starting index for the sequence need not be zero if it doesn’t make sense or some 


other starting index is more convenient. We saw two recurrence relations for the number 
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of derangements of [n]: 
Dy = 0, Dy _ nDn-1 + (—1)”. 


and 
Dy = 0, De = 1 Diy = (n = 1)(Dn-1 + DD p23), 


To “solve” a recurrence relation means to find a formula for a,. There are a variety 
of methods for solving recurrence relations, with various advantages and disadvantages in 
particular cases. One method that works for some recurrence relations involves generating 
functions. The idea is simple, if the execution is not always: Let 


fle) = ra’, 
1=0 


that is, let f(a) be the generating function for {a;}92,. We now try to manipulate f(x), 
using the recurrence relation, until we can solve for f(x) explicitly. Finally, we hope that 
we can find a formula for the coefficients from the formula for f(z). 


EXAMPLE 3.4.1 Solve Fo = 0, Fy = 1, Fy = Fn-it+ Fn-2. 
Let 


fio S° Fyx' 
i=0 
and note that 


ney So Ratt = . F,_y2°. 
i=0 i=1 


To get the second sum we have simply “re-indexed” so that the index value gives the 
exponent on 2, just as in the series for f(a). Likewise, 


x’ f(x) = > Ratt? = ie 
i=0 i=2 


In somewhat more suggestive form, we have 


f(z) = a2 + For? + Fyx? + Fyrt +--- 
ey Ce) g? + For? + F3x4 
0? fe) = tet che, 


- 


and combining the three equations we get 


f(x) —af(x) —27 f(z) =a 4+ (Po—1)2? + (FR -—- -1)2? +(h- Fe — Fz +. 
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or in more compact form 
f(x) —xf(x) — 2? f(x) = S- Fya' — a Fy_y2' — S- Fy_ox" 
i=0 i=1 i=2 


x+ SOF — Fy, — Fy_2)x* 


i=2 
=ar+ » 0-2? 
i=2 
— Z, 
recalling that Fo = 0 and Fi = 1. Now 
Bi —x 


7 fa poge peed 


f (2) 


If we can find an explicit representation for the series for this function, we will have solved 
the recurrence relation. Here is where things could go wrong, but in this case it works out. 
Let a and b be the roots of x? + 2 — 1; using the quadratic formula, we get 


-1+v5 ee a 


2 2 


Borrowing a technique from calculus, we write 


—2x 7 A B 
geigok! eee, pe bh 


Solving for A and B gives 
fe ED pices 
2/5 2/5 


—£ A, A Be 
e+e-1° al=az/a b1—x/b 


Then 


From calculus we know that 


it 1 it 
age a ap 


Finally, this means the coefficient of x’ in the series for f(z) is 


Aaya) — Fapy. 


Recs 
a 
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Simplifying gives 
Ee = 


1 = 1 Ess) 
v5\ 2 V5\ 2 7° 

Here’s an interesting feature of this expression: since |(1 — /5)/2| < 1, the limit of 
((1 — V5) /2)* as i goes to infinity is 0. So when 3 is large enough, 


F;, = round (3 )) ) 


that is, the first term rounded to the nearest integer. As it turns out, this is true starting 
with 7 = 0. 
You can see how to do the entire solution in Sage. 


We can also use this expression for F;, to compute limp... Fr /Fn—1- 


ff tta/a Af tes | 
an, . Jal 3 ) J 2 ) 1+ V5 
lim — lim n-1 n—1 = . 
n—rco Fy moo 71 (434) 1 (44) 2 


/5 2 4/5 2 


This is the so-called “golden ratio”. 


Exercises 3.4. 


1. Find the generating function for the solutions to hn = 4hn—1 — 3hn—2, ho = 2, hi = 5, and 
use it to find a formula for hy. 


2. Find the generating function for the solutions to hn = 3hn—1+4hn—2, ho = hi = 1, and use 
it to find a formula for hn. 


3. Find the generating function for the solutions to hn = 2hn—1 +3”, ho = 0, and use it to find 
a formula for hp. 


4. Find the generating function for the solutions to hn = 4hn—2, ho = 0, hi = 1, and use it to 
find a formula for h,. (It is easy to discover this formula directly; the point here is to see 
that the generating function approach gives the correct answer.) 


5. Find the generating function for the solutions to hn = hn—-1+hn—2, ho = 1, hi = 3, and use 
it to find a formula for hy. 

6. Find the generating function for the solutions to hn = 9hn—1 — 26hn—2 + 24hn-3, ho = 0, 
hy = 1, ho = —1, and use it to find a formula for hy. 

7. Find the generating function for the solutions to hn = 3hn-1 + 4hn-2, ho = 0, hi = 1, and 
use it to find a formula for hy. 

8. Find a recursion for the number of ways to place flags on an n foot pole, where we have red 
flags that are 2 feet high, blue flags that are 1 foot high, and yellow flags that are 1 foot 
high; the heights of the flags must add up to n. Solve the recursion. 

9. In Fibonacci’s original problem, a farmer started with one (newborn) pair of rabbits at month 


0. After each pair of rabbits was one month old, they produced another pair each month 
in perpetuity. Thus, after 1 month, he had the original pair, after two months 2 pairs, 
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three months, 3 pairs, four months, 5 pairs, etc. The number of pairs of rabbits satisfies 
hn = hn—-1 + hn—2, ho = hi = 1. (Note that this is slightly different than our definition, in 
which ho = 0.) 

Suppose instead that each mature pair gives birth to two pairs of rabbits. The sequence 
for the number of pairs of rabbits now starts out ho = 1, hi = 1, he = 3, hg = 5, ha = 11. 
Set up and solve a recurrence relation for the number of pairs of rabbits. Show also that the 
sequence statisfies hn = 2hn—1 + (—1)”. 


1 14v5\" 1 i-v5\" 
ae a ie 1+V5 


n—1 n—1 
n+co Fyn 1 noo 1 14/5 1 1-VB 2 
V5 2 V5 2 


10. Explain why 


3.5 CATALAN NUMBERS 


A rooted binary tree is a type of graph that is particularly of interest in some areas 
of computer science. A typical rooted binary tree is shown in figure 3.5.1. The root is 
the topmost vertex. The vertices below a vertex and connected to it by an edge are the 
children of the vertex. It is a binary tree because all vertices have 0, 1, or 2 children. How 
many different rooted binary trees are there with n vertices? 


Figure 3.5.1 <A rooted binary tree. 


Let us denote this number by C,,; these are the Catalan numbers. For convenience, 
we allow a rooted binary tree to be empty, and let Co = 1. Then it is easy to see that 
C;, = 1 and C2 = 2, and not hard to see that C3 = 5. Notice that any rooted binary tree 
on at least one vertex can be viewed as two (possibly empty) binary trees joined into a 
new tree by introducing a new root vertex and making the children of this root the two 
roots of the original trees; see figure 3.5.2. (To make the empty tree a child of the new 
vertex, simply do nothing, that is, omit the corresponding child.) 

Thus, to make all possible binary trees with n vertices, we start with a root vertex, and 
then for its two children insert rooted binary trees on k and | vertices, with k +1 = n-—1, 
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NC 


Figure 3.5.2 Producing a new tree from smaller trees. 


for all possible choices of the smaller trees. Now we can write 
n—1 
Cn =D) CiCn—i-1- 
i=0 


For example, since we know that Cp = C, = 1 and Cp» = 2, 
C3 = CoCo + CyCy, + CoCo = 1-24+1-142-1=5, 


as mentioned above. Once we know the trees on 0, 1, and 2 vertices, we can combine them 
in all possible ways to list the trees on 3 vertices, as shown in figure 3.5.3. Note that the 
first two trees have no left child, since the only tree on 0 vertices is empty, and likewise 
the last two have no right child. 


AOS 4 


Figure 3.5.3 The 3-vertex binary rooted trees. 


Now we use a generating function to find a formula for C,,. Let f = S>7°, Cz. 
Now consider f?: the coefficient of the term x” in the expansion of f? is ar CiCn_-i, 
corresponding to all possible ways to multiply terms of f to get an x” term: 


CeCe Cae © ae Go eC ase” Oe oe 
Now we recognize this as precisely the sum that gives C;,41, so f? = po Cn4i2". If we 


multiply this by x and add 1 (which is Co) we get exactly f again, that is, ¢f?+1 =f or 
xf? — f +1 =0; here 0 is the zero function, that is, xf? — f +1 is 0 for all x. Using the 
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quadratic formula, 


1l+VJ/1-—42 
f= a 

Ms 
as long as x # 0. It is not hard to see that as x approaches 0, 


1+ 1-42 
20 


goes to infinity while 
l= V1l=4 
22 
goes to 1. Since we know f(0) = Co = 1, this is the f we want. 
Now by Newton’s Binomial Theorem 3.1.1, we can expand 


V1 —4e = (1+ (—42))/? = Ss ee (—42)". 
n=0 
Then 
diab 4g ->- (PV 4ynyn-} =-3(pcone 


Expanding the binomial coefficient ek ne ) and reorganizing the expression, we discover that 


cra malne)O™ = aan) 


In exercise 7 in section 1.2, we saw that the number of properly matched sequences of 


parentheses of length 2n is Co — Ga, and called this Cy. It is not difficult to see that 


7") 7 (oi) 7 oat) 


so the formulas are in agreement. 


Temporarily let A, be the number of properly matched sequences of parentheses of 
length 2n, so from the exercise we know A, = in — (° Ay It is possible to see directly 
that Ag = A; = 1 and that the numbers 4A,, satisfy the same recurrence relation as do the 
Cy, which implies that A, = C;,, without manipulating the generating function. 

There are many counting problems whose answers turns out to be the Catalan numbers. 
Enumerative Combinatorics: Volume 2, by Richard Stanley, contains a large number of 


examples. 
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2n _ 2n —_ il 2n 
n nt+1}/ nt+i1\n/]' 
2. Find a simple expression f(n) so that Cy41 = f(n)Cy. Use this to compute C1,..., C6 from 


Co. 


3. Show that if A, is the number of properly matched sequences of parentheses of length 2n, 
then 


Exercises 3.5. 


1. Show that 


n-1 
An = d, ee: eee 


Do this in the same style that we used for the number of rooted binary trees: Given all the 
sequences of shorter length, explain how to combine them to produce the sequences of length 
2n, in such a way that the sum clearly counts the number of sequences. Hint: Prove the 
following lemma: If s is a properly matched sequence of parentheses of length 2n, s may 
be written uniquely in the form (s1)s2, where s; and s2 are properly matched sequences of 
parentheses whose lengths add to 2n—2. For example, (())() = ({Q])[O] and ((Q) = ({))[(O)I, 
with the sequences s1 and s2 indicated by []. Note that si and s2 are allowed to be empty 
sequences, with length 0. 


4. Consider a “staircase” as shown below. A path from A to B consists of a sequence of edges 
starting at A, ending at B, and proceeding only up or right; all paths are of length 6. One 
such path is indicated by arrows. The staircase shown is a “3 x 3” staircase. How many 
paths are there in an n X n staircase? 


B 


A 


5. A convex polygon with n > 3 sides can be divided into triangles by inserting n — 3 non- 
intersecting diagonals. In how many different ways can this be done? The possibilities for 
nm = 5 are shown. 


DINER Er, 


6. A partition of a set S is a collection of non-empty subsets A; C S,1<i<k (the parts of 
the partition), such that Ghar A; = S and for every 1 4 j, Ai M A; = @. For example, one 
partition of {1,2,3,4,5} is {{1,3}, {4}, {2, 5}}. 

Suppose the integers 1,2,...,m are arranged on a circle, in order around the circle. A 
partition of {1,2,...,n} is a non-crossing partition if it satisfies this additional property: 
If w and x are in some part A;, and y and z are in a different part A;, then the line joining 
w to x does not cross the line joining y to z. The partition above, {1,3}, {4}, {2,5}, is not 
a non-crossing partition, as the the line 1-3 crosses the line 2-5. 
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Find the number of non-crossing partitions of {1,2,...,n}. 
Recall from section 1.4 that the Bell numbers count all of the partitions of {1,2,...,n}. 
Hence, this exercise gives us a lower bound on the total number of partitions. 


7. Consider a set of 2n people sitting around a table. In how many ways can we arrange for 
each person to shake hands with another person at the table such that no two handshakes 


cross? 


A 


Systems of Distinct Representatives 


Suppose that the student clubs at a college each send a representative to the student 
government from among the members of the club. No person may represent more than 
one club; is this possible? It is certainly possible sometimes, for example when no student 
belongs to two clubs. It is not hard to see that it could be impossible. So the first 
substantive question is: is there anything useful or interesting we can say about under 
what conditions it is possible to choose such representatives. 

We turn this into a more mathematical situation: 


DEFINITION 4.0.1 Suppose that A,, A2,..., An are sets, which we refer to as a set 
system. A (complete) system of distinct representatives is a sequence {21,%2,...%} 
such that x; € A; for all 7, and no two of the x; are the same. A (partial) system of distinct 
representatives is a sequence of distinct elements {x1,%2,...2,} such that x; € A;,, where 


Ji; J2,--+;Jk are distinct integers in [n]. 


In standard usage, “system of distinct representatives” means “complete system of dis- 
tinct representatives”, but it will be convenient to let “system of distinct representatives” 
mean either a complete or partial system of distinct representatives depending on context. 
We usually abbreviate “system of distinct representatives” as SDR. 

We will analyze this problem in two ways, combinatorially and using graph theory. 
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4.1 EXISTENCE OF SDRS 


In this section, SDR means complete SDR. It is easy to see that not every collection of sets 
has an SDR. For example, 


Ay _ {a, b}, Ao = 1G; b}, A3 — {a, b}. 


The problem is clear: there are only two possible representatives, so a set of three distinct 
representatives cannot be found. This example is a bit more general than it may at first 
appear. Consider 


A, = {a,b}, Ao = {a,b}, As = {a,b}, As = {b, c,d, e}. 


Now the total number of possible representatives is 5, and we only need 4. Nevertheless, 

this is impossible, because the first three sets have no SDR considered by themselves. Thus 

the following condition, called Hall’s Condition, is clearly necessary for the existence of 

an SDR: For every k > 1, and every set {i1,7%2,...,7n} © [n], am A;,| =k. That is, the 

number of possible representatives in any collection of sets must be at least as large as the 

number of sets. Both examples fail to have this property because |A; U Az U A3| = 2 < 3. 
Remarkably, this condition is both necessary and sufficient. 


THEOREM 4.1.1 Hall’s Theorem A collection of sets A, A2,..., An has an SDR 
if and only if for every k > 1, and every set {71,72,...,7%} C [nl, | Gia A;,| = k. 


Proof. We already know the condition is necessary, so we prove sufficiency by induction 
on n. 

Suppose n = 1; the condition is simply that |Ai| > 1. If this is true then A, is 
non-empty and so there is an SDR. This establishes the base case. 

Now suppose that the theorem is true for a collection of k < n sets, and suppose we 
have sets A,, Ag,..., An satisfying Hall’s Condition. We need to show there is an SDR. 

Suppose first that for every k < n and every {i1,%2,...,ix} C [n], that | Ws A;,| > 
k +1, that is, that these unions are larger than required. Pick any element xz, € An, 
and define B; = A;\{xr,} for each i < n. Consider the collection of sets B,,...,Bn—1, 
and any union Wise B;, of a subcollection of the sets. There are two possibilities: either 
Ui By = Ay or LU Ba = Uy Ag fon) Bothat i Bil = [a Ag lor 
| Ui Bi,| =| Oe A;,|—1. In either case, since | Wes A;,| > k-+1, | Wien B,,| = k. Thus, 
by the induction hypothesis, the collection By,...,B,-1 has an SDR {x1,%2,...,%n_1}, 
and for every i <n, 4; # Xn, by the definition of the B;. Thus {x1, 22,...,2%,} is an SDR 
for A, Ao,...,An- 

If it is not true that for every k < n and every {71,72,...,7x} C [nl], | eae A;,| = k+1, 
then for some k < n and {i1,72,...,ix}, oe A;,| = k. Without loss of generality, we 
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may assume that | Ona A,| =k. By the induction hypothesis, A,, Ag,..., Ax has an SDR, 
{bi cceens 

Define B; = A;\ hae A; for i > k. Suppose that {x,41,...,2%n} is an SDR for 
Brii,---, Bn; then it is also an SDR for Ay4i1,...,An. Moreover, {%1,...,2n} is an SDR 
for A,,..., An. Thus, to finish the proof it suffices to show that B,41,..., By, has an SDR. 
The number of sets here is n — k < n, so we need only show that the sets satisfy Hall’s 
Condition. 
B 


So consider some sets B ig)+--, Di,- First we notice that 


a1) 
|A, UAgU---UA, UB, UB, U---B, | =k +|Bi, U Bi, U--- Bi,|. 
Also 
|A; UAgU---U A, U By, U By, U--+ By, | = |A1 U Ag U---U Ag U Aj, U AG, U- +> Ai, | 


and 
|A; UAgU---UA,UA;, UA; U-++ Az) 2 A+. 


Putting these together gives 


k+ |B, UB, U---UB,|>k+1 
Be UB, Ue Bel Si 


Thus, By41,..., Bn has an SDR, which finishes the proof. = 


Exercises 4.1. 


1. How many different systems of distinct representatives are there for Ai = {1,2}, A2 = {2,3}, 
inp A= (iil? 

2. How many different systems of distinct representatives are there for the sets A; = [n]\%, 
i=1,2,...,n,n > 2? 

3. Suppose the set system Aj, Ao,..., An has an SDR, and that x € A;. Show the set system 
has an SDR containing x. Show that x cannot necessarily be chosen to represent Aj. 

4. Suppose the set system Aj, A2,..., An satisfies os, Ai,| => k+1 for every 1 <k <n and 
{i1,%2,..., tn} C [n], and that « € A;. Show the set system has an SDR in which x represents 
Aj. 

5. An m xn chessboard, with m even and both m and n at least 2, has one white and one black 
square removed. Show that the board can be covered by dominoes. 
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4.2 ParRTIAL SDRs 


In this section, SDR means partial SDR. 
If there is no complete SDR, we naturally want to know how many of the n sets can 
be represented, that is, what is the largest value of m so that some m of the sets have 


a complete SDR. Since there is no complete SDR, there are sets A;,,Aj;,,...,A;, such 
that Cea A;,| =1< k. Clearly at most / of these k sets have a complete SDR, so no 
SDR for Aj, Ag,...,An can be larger than n —k +1. Thus, m can be no larger than the 
minimum value, over all & and all collections of sets A;,, Aj,,..., Ai,, of n—k+| Use A,,|. 


Note that if US Ag | > kh, ak + Wes A;,| >, which tells us nothing. If k = 0, 
n—-k+| es A;,| =n (because empty unions are empty), so we are guaranteed that the 
minimum is never greater than n. In fact the minimum value of the expression is exactly 


the size of a largest SDR. 


THEOREM 4.2.1 The maximum size of an SDR for the sets A;, Ag,...,A, is the 
minimum value, for 0 < k <n and sets Aj,, Ai,,...,Ai,, of n-—k+| = Aj, |. 


Proof. Since no SDR can be larger than this minimum value, it suffices to show that we 
can find an SDR whose size is this minimum. The proof is by induction on n; the case 
n = | is easy. 

Suppose first that the minimum value is n, so that for all k and all collections of sets 


Aga Aas; .: a Ags 
k 
n-k+|\J Ai| 22. 
j=l 
Then rearranging we see that 
k 
| U A;,| = k, 
j=l 


so by Hall’s Theorem (4.1.1), there is an SDR of size n. 
Note that the minimum value of n — k + | ey A;,| occurs when | C= A;,|—kis a 


minimum, that is 


k 
min(n —k + | (J Ai,|) = 2+ min(| J Ai,|— 4). 


j=l j=l 


Suppose now that the minimum m is less than n, and that m = n—k+ ue A;,|, 
withO <k <n. Let B; = Aj,; since k < n, the induction hypothesis applies to the 
sets B,,...,By. Since each set B; is A;,, (en By,| — 1S oa A;,| —k, for all 1 and 


Bp,,.--,Bn,. Thus, the minimum value of an B;,| — 1, over all 1 and By,,..., Bn,, is 
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| a B= k= | Ue A;,|— k, so by the induction hypothesis, the sets A;,,Aj,,...,A 
have an SDR of size k —k + | oan A;,| =| Ca Ay, | = tt hy {Pignnts ines 

Now consider the n — k sets consisting of those original sets not in A;,, Aj,,...,Ai,, 
that is, {A; | i ¢ {t1,...,7,}}. Let C; = Aj\ les A;, for i not in 41,%2,...,i%. Consider 
some sets Cy,,Cy,.,.--,C,. If | rs Cy,| < J then Us Cy,;| -—1<0 and 


th 


k k 
n—-k+||JAi|>n—-k-14+/JC,,)+1 LU Aa! 


j=l j=l j=l 
>n—(k+l)+|Cy, U---UCy, UAi, U---U Aj, | 
=n—(k+l)+|Ag, U---UAg, UA;, U-+-U Ai, |, 


l 


contradicting the fact that n—k+| Cs A;,|isaminimum. Thus by Hall’s Theorem (4.1.1), 
the sets Cy,,Cg,,.--,Cg,_, have a complete SDR {yi,---,Yn—zx}. By the definition of the 
sets Cp 4 Piss aes Wimew et (110i. ne Sy 0 Aig enki ney 8 
an SDR of sizem—n+k+n—k=mas desired. 


Finally, suppose that the minimum value of n—k+ *__ A;.| occurs only when k = n, 
gale G 


n—ntl|JAjl=1U Ail 
j=l j=l 


so we want an SDR of size 


Then 
n—(n—1)+ A;| > LJ 4; 


1+ A;| > U4, 


A;| > U Ail 


Since Wee Aj] < |Uj21 Ail, ena Aj| = |Uj_1 A;|. By the induction hypothesis, the 
theorem applies to the sets Ai, Ao,..., An—1. If the minimum of (n — 1) —1+| C= A;,| 
occurs when / = n — 1, then there is an SDR of size (n — 1) — (n-—1) + (Os Ay|\= 
| Dizi Aj| =|Uj_, Ajl, as desired. 

If the minimum occurs when / < n — 1 and not when / = n — 1, then 


L n-1 
(n—1)-1+|)J Ail <| U Ail 
j=l j=l 
n-1 


l 
n—1+ ||) Agl<| LU) Al +1 


soi j=l 
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and by assumption 
l n 
n—-1+|(J 4,1 >1U Ail 
j=l j=l 
Thus 


n I 
[UJ Ail<n-t4+|U Ay 
j=l j=1 


This means that there is an integer strictly between two consecutive integers, a contradic- 
tion. This completes the proof. = 


While this theorem provides a method to calculate the size of a maximum SDR, the 
method is hardly efficient: it requires looking at all possible collections of the sets. It also 
does not provide a way to find an actual SDR, that is, the actual representatives. We will 
fix these problems in the last two sections of this chapter. 


Exercises 4.2. 


1. Find the size of a maximum SDR for 
A, = {a,b,c}, A2 = {a,b,c,d,e}, A3 = {a,b}, As = {b,c}, As = {a}, Ac = {a,c, e}. 


Justify your answer. 


4.3 LATIN SQUARES 


DEFINITION 4.3.1 A Latin square of order n is an n x n grid filled with n symbols 
so that each symbol appears once in each row and column. 


EXAMPLE 4.3.2 Here is a Latin square of order 4: 


0 | & | a | o 
& a1 O 19% 
4/0 /|9 | & 
> |O | d | a 


Usually we use the integers 1...n for the symbols. There are many, many Latin 
squares of order n, so it pays to limit the number by agreeing not to count Latin squares 
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that are “really the same” as different. The simplest way to do this is to consider reduced 
Latin squares. A reduced Latin square is one in which the first row is 1... (in order) 
and the first column is likewise 1...n. 


EXAMPLE 4.3.3 Consider this Latin square: 


4 


2 
4 
3 
1 


3 | 1 
2 1} 3 
1 4 | 2 
3 2|4 


The order of the rows and columns is not really important to the idea of a Latin square. 
If we reorder the rows and columns, we can consider the result to be in essence the same 
Latin square. By reordering the columns, we can turn the square above into this: 


Le) 2s) 3s) 4 
o> | a |) Be 
ales al (aE 
4)1/2]38 
Then we can swap rows two and three: 
cn ea am? 
yaaa es al (ne? a 
OAs de 2 
As I) | 4 oe 


This Latin square is in reduced form, and is essentially the same as the original. 


Another simple way to change the appearance of a Latin square without changing its 
essential structure is to interchange the symbols. 


EXAMPLE 4.3.4 Starting with the same Latin square as before: 


4 


Zils lt al 
2);4/ 1) 3 
1/3 ]4 4] 2 
3 | 1/2] 4 


we can interchange the symbols 1 and 4 to get: 
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Oe oO) 
eye]; w] se 


ee 
2|1 
4 | 3 
3.) 4 


Now if we swap rows three and four we get: 


3} 4 
4 | 3 
ae 
1 | 2 


wy ee] rR | be 


1 
2 
3 
4 


Notice that this Latin square is in reduced form, but it is not the same as the reduced 
form from the previous example, even though we started with the same Latin square. Thus, 


we may want to consider some reduced Latin squares to be the same as each other. 


DEFINITION 4.3.5 Two Latin squares are isotopic if each can be turned into the 
other by permuting the rows, columns, and symbols. This isotopy relation is an equivalence 


relation; the equivalence classes are the isotopy classes. 


Latin squares are apparently quite difficult to count without substantial computing 
power. According to Wikipedia, the number of Latin squares is known only up to n = 11. 
Here are the first few values for all Latin squares, reduced Latin squares, and non-isotopic 
Latin squares (that is, the number of isotopy classes): 


576 
161280 


How can we produce a Latin square? If you know what a group is, you should know 
that the multiplication table of any finite group is a Latin square. (Also, any Latin square 
is the multiplication table of a quasigroup.) Even if you have not encountered groups by 
that name, you may know of some. For example, considering the integers modulo n under 
addition, the addition table is a Latin square. 


EXAMPLE 4.3.6 Here is the addition table for the integers modulo 6: 
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Re | OO] ory | Ww] bh 


wpe | oO] ou; e® | w 


Wilmer ;lo]youy ss 


eo Oe 
Oyo; ek] wlywpo]ye 
em l_wlrme le | oO] o 


EXAMPLE 4.3.7 Here is another way to potentially generate many Latin squares. 


Start with first row 1,...,n. Consider the sets A; = [n|\{i}. From exercise 1 in section 4.1 
we know that this set system has many SDRs; if 71, %2,...,2%, is an SDR, we may use it for 
row two. In general, after we have chosen rows 1,...,7, we let A; be the set of integers 


that have not yet been chosen for column 7. This set system has an SDR, which we use for 


row 7 +1. 


DEFINITION 4.3.8 Suppose A and B are two Latin squares of order n, with entries 
A;,; and B,,; in row i and column j. Form the matrix M with entries M;,; = (Ai,;, Bi,;); 
we will denote this operation as M = AU B. We say that A and B are orthogonal if 
M contains all n? ordered pairs (a,b), 1 <a<n,1< 06 <n, that is, all elements of 
ae en et ee eee 


As we will see, it is easy to find orthogonal Latin squares of order n if n is odd; not 
too hard to find orthogonal Latin squares of order 4k, and difficult but possible to find 
orthogonal Latin squares of order 4k+2, with the exception of orders 2 and 6. In the 1700s, 
Euler showed that there are orthogonal Latin squares of all orders except of order 4k + 2, 
and he conjectured that there are no orthogonal Latin squares of of order 6. In 1901, the 
amateur mathematician Gaston Tarry showed that indeed there are none of order 6, by 
showing that all possibilities for such Latin squares failed to be orthogonal. In 1959 it was 
finally shown that there are orthogonal Latin squares of all other orders. 


THEOREM 4.3.9 There are pairs of orthogonal Latin squares of order n when n is 
odd. 


Proof. This proof can be shortened by using ideas of group theory, but we will present 
a self-contained version. Consider the addition table for addition mod n: 
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We claim first that this (without the first row and column, of course) is a Latin square with 
symbols 0,1,...,2 — 1. Consider two entries in row i, sayi+j andi+k. Ifi+j=i+ 9 
(mod n), then j = k, so j = k. Thus, all entries of row 7 are distinct, so each of 0,1,...,n—1 
appears exactly once in row 7. The proof that each appears once in any column is similar. 
Call this Latin square A. (Note that so far everything is true whether n is odd or even.) 

Now form a new square B with entries B;,; = Agi; = 2i+ 7, where by 22 and 27+ j 
we mean those values mod n. Thus row i of B is the same as row 2i of A. Now we claim 
that in fact the rows of B are exactly the rows of A, in a different order. To do this, it 
suffices to show that if 21 = 2k (mod n), then i = k. This implies that all the rows of B 
are distinct, and hence must be all the rows of A. 

Suppose without loss of generality that i > k. If 22 = 2k (mod n) then n | 2(i — k). 
Since n is odd, n | (tk). Since i and & are in 0,1,...,.n-1,0<i—k<n-1. Of these 
values, only 0 is divisible by n, soi — k = 0. Thus B is also a Latin square. 

To show that AU B contains all n? elements of {0,1,...,2—1} x {0,1,...,n—1}, it 
suffices to show that no two elements of AUB are the same. Suppose that (71+), 27; +71) = 
(t2 + Jo, 2i2 + jz) (arithmetic is mod n). Then by subtracting equations, 7; = 72; with the 
first equation this implies 7; = jo. rT] 


EXAMPLE 4.3.10 Whenn= 


>) 


1 2 (0,0) 
0 1] =| (1,2) 
20 (2,1) 


(1, 1) 
(2, 0) 
(0, 2) 


(2, 2) 
(0, 1) 
1,0 


3 
0 
2 
1 (1,0) 


Oe de 2 
1. 2. 0) U 
2900 oh 


One obvious approach to constructing Latin squares, and pairs of orthogonal Latin 
squares, is to start with smaller Latin squares and use them to produce larger ones. We 
will produce a Latin square of order mn from a Latin square of order m and one of order 
n. 

Let A be a Latin square of order m with symbols 1,...,m, and B one of order n with 
symbols 1,...,n. Let ¢,;, 1<i<m,1<j <n, be mn new symbols. Form an mn x mn 


grid by replacing each entry of B with a copy of A. Then replace each entry 7 in this copy 


4.3 Latin Squares 81 


of A with c;,;, where j is the entry of B that was replaced. We denote this new Latin 
square A x B. Here is an example, combining a 4 x 4 Latin square with a 3 x 3 Latin 


square to form a 12 x 12 Latin square: 


1}2)3/4 11213 
zac ec ea ese 
3/4/1)2 3 9 
4/1}2)3 


THEOREM 4.3.11 If A and B are Latin squares, so is A x B. 


Proof. Consider two symbols c;,; and c,, in the same row. If the positions containing 
these symbols are in the same copy of A, then 7 4 k, since A is a Latin square, and so 
the symbols c;,; and cz, are distinct. Otherwise, 7 € 1, since B is a Latin square. The 


argument is the same for columns. = 


Remarkably, this operation preserves orthogonality: 


THEOREM 4.3.12 If A, and Ag are Latin squares of order m, B, and Bg are Latin 
squares of order n, A, and Ag are orthogonal, and B, and Bz are orthogonal, then A, x By; 
is orthogonal to A; x Bo. 


Proof. We denote the contents of A; x B; by C;(w,2z,y,z), meaning the entry in row 
w and column zx of the copy of A; that replaced the entry in row y and column z of B;, 
which we denote B;(y,z). We use A;(w, x) to denote the entry in row w and column z of 
Aj. 

Suppose that (Ci(w, 2, y, z), Co(w, x, y, z)) = (Ci(w’, 2’, y’, 2’), Co(w’, x’, y’, 2’)), where 
(w,2,y,z) 4 (w’,2’,y’,z’). Either (w,2) ~ (w’,x’) or (y,z) # (y’,2’). If the lat- 
ter, then (B,(y,z), Ba(y,z)) = (Bi(y’, 2’), Bo(y’, 2’)), a contradiction, since By, is or- 
thogonal to Bj. Hence (y,z) = (y’,2’) and (w,x) 4 (w’,2’). But this implies that 
(Ai(w, x), Ao(w,x)) = (Ai(w’, 2’), Ao(w’, x’)), a contradiction. Hence A; x By is orthog- 
onal to A, x Bo. a 
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We want to construct orthogonal Latin squares of order 4k. Write 4k = 2™”-n, 
where n is odd and m > 2. We know there are orthogonal Latin squares of order n, by 
theorem 4.3.9. If there are orthogonal Latin squares of order 2”, then by theorem 4.3.12 
we can construct orthogonal Latin squares of order 44 = 2™-n. 

To get a Latin square of order 2”, we also use theorem 4.3.12. It suffices to find 
two orthogonal Latin squares of order 4 = 2? and two of order 8 = 23. Then repeated 
application of theorem 4.3.12 allows us to build orthogonal Latin squares of order 2”, 
m > 2. 

Two orthogonal Latin squares of order 4: 


12 3 4 12 3 4 

2 1 4 3 3.4 1 2 

3 4 1 2 4 3 2 1]? 

4 3 2 1 2 1 4 3 

and two of order 8: 

13 45 67 8 2 145 6 7 8 2 3 
5 2 7 18 4 6 3 8 265 3 14 7 
6 4.3 8.1 2 5 7 28 3 7 6 4 1 5 
7 8 5 42 1 3 6 3.6 2 4 °8 7 -5 1 
8 7 26 5 3 1 4 4173 52 8 6 
2 5 8 38 7 6 4 1 5 F 18 4 6 3 2 
ol 6 24 8 7 5 6 3 8 1 25 7 4 
4617 3 5 2 8 ' 5 42 1 3.6 8 


Exercises 4.3. 


1. Show that there is only one reduced Latin square of order 3. 
2. Verify that the isotopy relation is an equivalence relation. 


3. Find all 4 reduced Latin squares of order 4. Show that there are at most 2 isotopy classes 
for order 4. 

4. Show that the second set system defined in example 4.3.7 has an SDR as claimed. 

5. Show that there are no orthogonal Latin squares of order 2. 

6. Find the two orthogonal Latin squares of order 5 as described in theorem 4.3.9. Show your 
answer as in example 4.3.10. 

7. Prove that to construct orthogonal Latin squares of order 2”, m > 2, it suffices to find two 
orthogonal Latin squares of order 4 = 2? and two of order 8 = 2?. 

8. Ann xn Latin square A is symmetric if it is symmetric around the main diagonal, that is, 
Aj; = Aj,; for all 2 and j. It is easy to find symmetric Latin squares: every addition table 
modulo n is an example, as in example 4.3.6. A Latin square is idempotent if every symbol 
appears on the main diagonal. Show that if A is both symmetric and idempotent, then n is 
odd. Find a 5 x 5 symmetric, idempotent Latin square. 
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9. The transpose A! of a Latin square A is the reflection of A across the main diagonal, so 
that Aj; = A;,;. A Latin square is self-orthogonal if A is orthogonal to A'. Show that there 
is no self-orthogonal Latin square of order 3. Find one of order 4. 


4.4 INTRODUCTION TO GRAPH THEORY 


We can interpret the SDR problem as a problem about graphs. Given sets A;, A2,..., An, 
with Us_, Ai = {11,%2,...,U%m}, we define a graph with n +m vertices as follows: The 
vertices are labeled {.Aj, Ag,..., An, £1, £2,.-.£Lm}, and the edges are {{A;,x;} | x; € Aj}. 


EXAMPLE 4.4.1 Let A; = {a,b,c,d}, Ao = {a,c,e,g}, Az = {b,d}, and A, = {a, f,g}. 
The corresponding graph is shown in figure 4.4.1. 


Aj Ao A3 Ay 


a b c d e€ 7 g 


Figure 4.4.1 <A set system depicted as a bipartite graph. 


Before exploring this idea, we introduce a few basic concepts about graphs. If two 
vertices in a graph are connected by an edge, we say the vertices are adjacent. If a vertex v 
is an endpoint of edge e, we say they are incident. The set of vertices adjacent to v is called 
the neighborhood of v, denoted N(v). This is sometimes called the open neighborhood 
of v to distinguish it from the closed neighborhood of v, N|[v] = N(v)U{v}. The degree 
of a vertex v is the number of edges incident with v; it is denoted d(v). 

Some simple types of graph come up often: A path is a graph P,, on vertices v1, v2,..-, Un, 
with edges {v;,v;41} for 1 < i < n—1, and no other edges. A cycle is a graph C, on 
vertices U1, U2,...,Un with edges {v;,01+(4 mod n)} for 1 < i <n, and no other edges; this 
is a path in which the first and last vertices have been joined by an edge. (Generally, we 
require that a cycle have at least three vertices. If it has two, then the two are joined by 
two distinct edges; when a graph has more than one edge with the same endpoints it is 
called a multigraph. If a cycle has one vertex, there is an edge, called a loop, in which 
a single vertex serves as both endpoints.) The length of a path or cycle is the number of 
edges in the graph. For example, P; has length 0, C; has length 1. A complete graph 
K,, is a graph on v4, v2,...,Un in which every two distinct vertices are joined by an edge. 
See figure 4.4.2 for examples. 

The graph in figure 4.4.1 is a bipartite graph. 
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AC 8 


Figure 4.4.2 Graphs Ps, Ce, Ks. 


DEFINITION 4.4.2 A graph G is bipartite if its vertices can be partitioned into two 
parts, say {v1,v2,...,Un} and {w1, w2,...,Wm} so that all edges join some v; to some w;; 


no two vertices v; and v; are adjacent, nor are any vertices w; and w;. 


The graph in figure 4.4.1 is bipartite, as are the first two graphs in figure 4.4.2. 


4.5 MATCHINGS 


Now we return to systems of distinct representatives. 

A system of distinct representatives corresponds to a set of edges in the corresponding 
bipartite graph that share no endpoints; such a collection of edges (in any graph, not just 
a bipartite graph) is called a matching. In figure 4.5.1, a matching is shown in red. This 
is a largest possible matching, since it contains edges incident with all four of the top 


vertices, and it thus corresponds to a complete SDR. 


Aj Ao A3 Ay 


a b Cc d e€ ti g 


Figure 4.5.1 <A set system depicted as a bipartite graph. 


Any bipartite graph can be interpreted as a set system: we simply label all the vertices 
in one part with “set names” A, Ag, etc., and the other part is labeled with “element 
names”, and the sets are defined in the obvious way: A; is the neighborhood of the vertex 
labeled “A;”. Thus, we know one way to compute the size of a maximum matching, namely, 
we interpret the bipartite graph as a set system and compute the size of a maximum SDR; 


this is the size of a maximum matching. 
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We will see another way to do this working directly with the graph. There are two 
advantages to this: it will turn out to be more efficient, and as a by-product it will actually 
find a maximum matching. 

Given a bipartite graph, it is easy to find a maximal matching, that is, one that cannot 
be made larger simply by adding an edge: just choose edges that do not share endpoints 
until this is no longer possible. See figure 4.5.2 for an example. 


Figure 4.5.2 A maximal matching is shown in red. 


An obvious approach is then to attempt to make the matching larger. There is a 
simple way to do this, if it works: We look for an alternating chain, defined as follows. 


DEFINITION 4.5.1 Suppose M isa matching, and suppose that v1, w1, v2, W2,..., UR, Wk 
is a sequence of vertices such that no edge in M is incident with v, or wz, and moreover 
for all 1 <i <k, uv; and w; are joined by an edge not in M, and for all 1 <i< k-—1, 
w; and v;41 are joined by an edge in M. Then the sequence of vertices together with the 


edges joining them in order is an alternating chain. 


The graph in figure 4.5.2 contains alternating chains, one of which is shown in fig- 
ure 4.5.3. 


Figure 4.5.3 An alternating chain. 


Suppose now that we remove from M all the edges that are in the alternating chain 
and also in M, forming M’, and add to M’ all of the edges in the alternating chain not in 
M, forming M”. It is not hard to show that M” is a matching, and it contains one more 
edge than M. See figure 4.5.4. 

Remarkably, if there is no alternating chain, then the matching M is a maximum 


matching. 


THEOREM 4.5.2 Suppose that M is a matching in a bipartite graph G, and there is 
no alternating chain. Then M is a maximum matching. 
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Figure 4.5.4 A new, larger matching. 


Proof. We prove the contrapositive: Suppose that M is not a maximum matching. Then 
there is a larger matching, N. Create a new graph G’ by eliminating all edges that are in 
both M and N, and also all edges that are in neither. We are left with just those edges in 
M or N but not both. 

In this new graph no vertex is incident with more than two edges, since if v were 
incident with three edges, at least two of them would belong to M or two would belong to 
N, but that can’t be true since M and N are matchings. This means that G’ is composed 
of disjoint paths and cycles. Since N is larger than M, G’ contains more edges from N 
than from M, and therefore one of the paths starts and ends with an edge from N, and 
along the path the edges alternate between edges in N and edges in M. In the original 
graph G with matching M, this path forms an alternating chain. The “alternating” part 
is clear; we need to see that the first and last vertices in the path are not incident with 
any edge in M. 

Suppose that the first two vertices are vy and vg. Then v; and v2 are joined by an 
edge of N. Suppose that v1 is adjacent to a vertex w and that the edge between v, and w 
is in M. This edge cannot be in N, for then there would be two edges of N incident at v1. 
But then this edge is in G’, since it is in M but not N, and therefore the path in G’ does 
not start with the edge in N joining v, and v2. This contradiction shows that no edge of 
M is incident at v,. The proof that the last vertex in the path is likewise not incident with 
an edge of M is essentially identical. rT 


Now to find a maximum matching, we repeatedly look for alternating chains; when we 
cannot find one, we know we have a maximum matching. What we need now is an efficient 
algorithm for finding the alternating chain. 

The key, in a sense, is to look for all possible alternating chains simultaneously. Sup- 
pose we have a bipartite graph with vertex partition {v1, v2,...,Un} and {wy, we,...,Wm} 
and a matching M. The algorithm labels vertices in such a way that if it succeeds, the 
alternating chain is indicated by the labels. Here are the steps: 


0. Label with ‘(5,0)’ all vertices v; that are not incident with an edge in M. Set 
variable step to 0. 


Now repeat the next two steps until no vertex acquires a new label: 
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1. Increase step by 1. For each newly labeled vertex v;, label with (7, step) any 
unlabeled neighbor w,; of v; that is connected to v; by an edge that is not in M. 


2. Increase step by 1. For each newly labeled vertex w;, label with (i, step) any 
unlabeled neighbor v; of w; that is connected to w; by an edge in M. 


Here “newly labeled” means labeled at the previous step. When labeling vertices in 
step 1 or 2, no vertex is given more than one label. For example, in step 1, it may be that 
wr is a neighbor of the newly labeled vertices v; and v;. One of vj and v;, say v;, will be 
considered first, and will cause w; to be labeled; when v; is considered, wz; is no longer 
unlabeled. 

At the conclusion of the algorithm, if there is a labeled vertex w; that is not incident 
with any edge of M, then there is an alternating chain, and we say the algorithm succeeds. 
If there is no such w;, then there is no alternating chain, and we say the algorithm fails. 
The first of these claims is easy to see: Suppose vertex w; is labeled (k,,s). It became 
labeled due to vertex vz, labeled (k2,s — 1) that is connected by an edge not in M to w;. 
In turn, vz, is connected by an edge in M to vertex wz, labeled (kg, s — 2). Continuing in 
this way, we discover an alternating chain ending at some vertex v; labeled (5,0): since 
the second coordinate step decreases by 1 at each vertex along the chain, we cannot repeat 
vertices, and must eventually get to a vertex with step = 0. If we apply the algorithm to 
the graph in figure 4.5.2, we get the labeling shown in figure 4.5.5, which then identifies 
the alternating chain we, v3, wW1,v1. Note that as soon as a vertex w; that is incident with 
no edge of M is labeled, we may stop, as there must be an alternating chain starting at w;; 
we need not continue the algorithm until no more labeling is possible. In the example in 
figure 4.5.5, we could stop after step 3, when wz becomes labeled. Also, the step component 
of the labels is not really needed; it was included to make it easier to understand that. if 
the algorithm succeeds, there really is an alternating chain. 


(1,1) (Soo) (4,5) (3, 3) 
Wi W2 W3 W4 


UL (0D) U3 U4 
(S, 0) (3,6) (1, 2) (4, 4) 


Figure 4.5.5 Labeling of a bipartite graph with matching; wa, v3, wi, v1 is an alternating 
chain. 


To see that when the algorithm fails there is no alternating chain, we introduce a new 
concept. 


88 Chapter 4 Systems of Distinct Representatives 


DEFINITION 4.5.3 A vertex cover in a graph is a set of vertices S such that every 
edge in the graph has at least one endpoint in S. 


There is always a vertex cover of a graph, namely, the set of all the vertices of the 
graph. What is clearly more interesting is a smallest vertex cover, which is related to a 


maximum matching. 


THEOREM 4.5.4 If M isa matching in a graph and S is a vertex cover, then |M| < |S]. 


Proof. Suppose M is a matching S is a vertex cover. Since each edge of M has an 
endpoint in S, if |M| > |S| then some vertex in S is incident with two edges of M, a 
contradiction. Hence |M| < |S]. | 


Suppose that we have a matching M and vertex cover S for a graph, and that |M| = |S]. 
Then the theorem implies that M is a maximum matching and S is a minimum vertex 
cover. To show that when the algorithm fails there is no alternating chain, it is sufficient 
to show that there is a vertex cover that is the same size as M. Note that the proof of this 
theorem relies on the “official” version of the algorithm, that is, the algorithm continues 


until no new vertices are labeled. 


THEOREM 4.5.5 Suppose the algorithm fails on the bipartite graph G with matching 
M. Let L be the set of labeled w;, U the set of unlabeled v;, and S = LUU. Then S isa 
vertex cover and |M| = |S]. 


Proof. If S is not a cover, there is an edge {v;, w;} with neither v; nor w; in S, so v; is 
labeled and w, is not. If the edge is not in M, then the algorithm would have labeled w; 
at the step after v; became labeled, so the edge must be in M. Now v; cannot be labeled 
(S,0), so v; became labeled because it is connected to some labeled wz by an edge of M. 
But now the two edges {v;,w;} and {v;, wz} are in M, a contradiction. So S is a vertex 
cover. 

We know that || < |S], so it suffices to show |S| < |], which we can do by finding 
an injection from S to M. Suppose that w; € S, so w; is labeled. Since the algorithm 
failed, w; is incident with an edge e of M; let f(w;) =e. If vu; € S, v; is unlabeled; if vu; 
were not incident with any edge of M, then v; would be labeled (5,0), so vu; is incident with 
an edge e of M; let f(v;) = e. Since G is bipartite, it is not possible that f(w;) = f(w;) 
or f(u;) = f(v;). If f(wi) = f(v;), then w; and v; are joined by an edge of M, and the 
algorithm would have labeled v;. Hence, f is an injection. = 


We have now proved this theorem: 


THEOREM 4.5.6 Ina bipartite graph G, the size of a maximum matching is the same 


as the size of a minimum vertex cover. r | 
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It is clear that the size of a maximum SDR is the same as the size of a maximum 
matching in the associated bipartite graph G. It is not too difficult to see directly that 
the size of a minimum vertex cover in G is the minimum value of f(n,i1,i2,...,i~) = 
n—-k+ oe A;,|. Thus, if the size of a maximum matching is equal to the size of 
a minimum cover, then the size of a maximum SDR is equal to the minimum value of 
n—k+ oe A;,|, and conversely. More concisely, theorem 4.5.6 is true if and only if 
theorem 4.2.1 is true. 

More generally, in the schematic of figure 4.5.6, if any three of the relationships are 
known to be true, so is the fourth. In fact, we have proved all but the bottom equality, so 
we know it is true as well. 


max sdr =? max matching 
|? II? 
min f(Ry.s.) =? min cover 
Figure 4.5.6 If any three of the “=?” are “=”, so is the fourth. 


Finally, note that we now have both a more efficient way to compute the size of a 
maximum SDR and a way to find the actual representatives: convert the SDR problem to 
the graph problem, find a maximum matching, and interpret the matching as an SDR. 


Exercises 4.5. 


1. In this bipartite graph, find a maximum matching and a minimum vertex cover using the 
algorithm of this section. Start with the matching shown in red. Copies of this graph are 
available in this pdf file. 


V1 


2. Show directly that that the size of a minimum vertex cover in G is the minimum value of 
n—-k+| es Aj,|, as mentioned above. 


O 


Graph Theory 


5.1 THe BASICS 


See section 4.4 to review some basic terminology about graphs. 

A graph G consists of a pair (V, £), where V is the set of vertices and E the set of 
edges. We write V(G) for the vertices of G and E(G) for the edges of G when necessary 
to avoid ambiguity, as when more than one graph is under discussion. 

If no two edges have the same endpoints we say there are no multiple edges, and if 
no edge has a single vertex as both endpoints we say there are no loops. A graph with 
no loops and no multiple edges is a simple graph. A graph with no loops, but possibly 
with multiple edges is a multigraph. The condensation of a multigraph is the simple 
graph formed by eliminating multiple edges, that is, removing all but one of the edges with 
the same endpoints. To form the condensation of a graph, all loops are also removed. We 
sometimes refer to a graph as a general graph to emphasize that the graph may have 
loops or multiple edges. 

The edges of a simple graph can be represented as a set of two element sets; for 


example, 


({v1, siete U7}, {{v1, v2}, {v2, v3}, {v3, va}, {v3, 05) {v4, us}, {us, v6}, {v6, u7}h) 


is a graph that can be pictured as in figure 5.1.1. This graph is also a connected 
graph: each pair of vertices v, w is connected by a sequence of vertices and edges, 
U = V1, €1, V2, €2,---,Uk = W, where v; and v;+41 are the endpoints of edge e;. The graphs 
shown in figure 4.4.2 are connected, but the figure could be interpreted as a single graph 
that is not connected. 
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Figure 5.1.1 <A simple graph. 


A graph G = (V,£) that is not simple can be represented by using multisets: a loop 
is a multiset {v,v} = {2-v} and multiple edges are represented by making F a multiset. 
The condensation of a multigraph may be formed by interpreting the multiset FE as a set. 

A general graph that is not connected, has loops, and has multiple edges is shown in 
figure 5.1.2. The condensation of this graph is shown in figure 5.1.3. 


oN Oot 


Figure 5.1.2 <A general graph: it is not connected and has loops and mulitple edges. 


IN YX 


Figure 5.1.3 The condensation of the previous graph. 


The degree of a a vertex v, d(v), is the number of times it appears as an endpoint of 
an edge. If there are no loops, this is the same as the number of edges incident with v, but 
if v is both endpoints of an edge, namely, of a loop, then this contributes 2 to the degree of 
v. The degree sequence of a graph is a list of its degrees; the order does not matter, but 
usually we list the degrees in increasing or decreasing order. The degree sequence of the 
graph in figure 5.1.2, listed clockwise starting at the upper left, is 0,4, 2,3, 2,8, 2, 4, 3, 2, 2. 
We typically denote the degrees of the vertices of a graph by d;, 7 = 1,2,...,, where n is 
the number of vertices. Depending on context, the subscript 7 may match the subscript on 
a vertex, so that d; is the degree of v;, or the subscript may indicate the position of d; in 
an increasing or decreasing list of the degrees; for example, we may state that the degree 
sequence is dy < dg <---< dy. 

Our first result, simple but useful, concerns the degree sequence. 
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THEOREM 5.1.1 _ In any graph, the sum of the degree sequence is equal to twice the 


number of edges, that is, 
nm 


S/d; = 2\E|. 


i=1 


Proof. Let d; be the degree of v;. The degree d; counts the number of times v; appears 
as an endpoint of an edge. Since each edge has two endpoints, the sum }>""_, d; counts 
each edge twice. rT 


An easy consequence of this theorem: 


COROLLARY 5.1.2. The number of odd numbers in a degree sequence is even. = 


An interesting question immediately arises: given a finite sequence of integers, is it 
the degree sequence of a graph? Clearly, if the sum of the sequence is odd, the answer is 
no. If the sum is even, it is not too hard to see that the answer is yes, provided we allow 
loops and multiple edges. The sequence need not be the degree sequence of a simple graph; 
for example, it is not hard to see that no simple graph has degree sequence 0,1,2,3,4. A 
sequence that is the degree sequence of a simple graph is said to be graphical. Graphical 
sequences have been characterized; the most well known characterization is given by this 
result: 


THEOREM 5.1.3 A sequence d; > dy >... > dy, is graphical if and only if ie d; is 
even and for all k € {1,2,...,n}, 


k n 
Sod; <k(k-1)+ S> min(d;,k). 
i=1 i=k4+1 


It is not hard to see that if a sequence is graphical it has the property in the theorem; 
it is rather more difficult to see that any sequence with the property is graphical. 
What does it mean for two graphs to be the same? Consider these three graphs: 


Gy — ({v1, U2, U3; vat, {{v1, vo}, {vVa, v3}, {U3, va}, {V2, va}}) 

Go = ({v1, V2, U3, va}, {{v1, vo}, {v1, vs}, {v3, va}, {v2, va}}) 

G3 = ({w1, W2, W3; wa}, {{w1, we}, {wi, wa}, {w3, wa}, {wa, wa}}) 
These are pictured in figure 5.1.4. Simply looking at the lists of vertices and edges, they 
don’t appear to be the same. Looking more closely, Gz and G3 are the same except for 


the names used for the vertices: v; in one case, w; in the other. Looking at the pictures, 
there is an obvious sense in which all three are the same: each is a triangle with an edge 
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(and vertex) dangling from one of the three vertices. Although G, and G2 use the same 
names for the vertices, they apply to different vertices in the graph: in G, the “dangling” 
vertex (officially called a pendant vertex) is called v1, while in G2 it is called v3. Finally, 
note that in the figure, Gz and G3 look different, even though they are clearly the same 
based on the vertex and edge lists. 


U1 v2 V1 V2 W1 W3 


U3 U4 U3 U4 W2 W4 


Figure 5.1.4 Three isomorphic graphs. 


So how should we define “sameness” for graphs? We use a familiar term and definition: 


isomorphism. 


DEFINITION 5.1.4 Suppose G; = (V,£) and Gp = (W, F’). G; and G2 are isomor- 
phic if there is a bijection f:V + W such that {v1, v2} € F if and only if {f(v1), f(ve)} € 
F. In addition, the repetition numbers of {v1, v2} and {f(v1), f(v2)} are the same if mul- 
tiple edges or loops are allowed. This bijection f is called an isomorphism. When G, 


and Gp» are isomorphic, we write G; = Go. 


Each pair of graphs in figure 5.1.4 are isomorphic. For example, to show explicitly 
that G, = G3, an isomorphism is 


f(v1) = w3 
f(v2) = wa 
f(us) = we 
f(va) = wr 


Clearly, if two graphs are isomorphic, their degree sequences are the same. The con- 
verse is not true; the graphs in figure 5.1.5 both have degree sequence 1,1, 1,2, 2,3, but in 
one the degree-2 vertices are adjacent to each other, while in the other they are not. In 
general, if two graphs are isomorphic, they share all “graph theoretic” properties, that is, 
properties that depend only on the graph. As an example of a non-graph theoretic prop- 
erty, consider “the number of times edges cross when the graph is drawn in the plane.” 


In a more or less obvious way, some graphs are contained in others. 


DEFINITION 5.1.5 Graph H = (W, F’) isa subgraph of graph G = (V, E£) ifW CV 
and Ff C E. (Since H is a graph, the edges in F' have their endpoints in W.) H is an 
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Figure 5.1.5 Non-isomorphic graphs with degree sequence 1,1, 1, 2, 2, 3. 
v2 V1 V2 V4 V2 
U4 U4 V4 


Figure 5.1.6 Left to right: a graph, a subgraph, an induced subgraph. 


U1 


U3 


induced subgraph if F consists of all edges in F with endpoints in W. See figure 5.1.6. 


Whenever U C V we denote the induced subgraph of G on vertices U as G[U]. 


A path in a graph is a subgraph that is a path; if the endpoints of the path are v and 


w we say it is a path from v to w. A cycle in a graph is a subgraph that is a cycle. A 


clique in a graph is a subgraph that is a complete graph. 


If a graph G is not connected, define v ~ w if and only if there is a path connecting 


v and w. It is not hard to see that this is an equivalence relation. Each equivalence 


class corresponds to an induced subgraph G; these subgraphs are called the connected 


components of the graph. 


Exercises 5.1. 


1. 


The complement G of the simple graph G is a simple graph with the same vertices as G, and 
{v,w} is an edge of G if and only if it is not an edge of G. A graph G is self-complementary 
if G&G. Show that if G is self-complementary then it has 4k or 4k + 1 vertices for some k. 
Find self-complementary graphs on 4 and 5 vertices. 


Prove that if }>"_, di is even, there is a graph (not necessarily simple) with degree sequence 
di, dz,...,dn. 


Suppose d, > dz >--- > dn and )>""_, d; is even. Prove that there is a multigraph (no loops) 
with degree sequence di, d2,...,dn if and only if di < S7i., di. 


4. Prove that 0,1,2,3,4 is not graphical. 


Is 4,4,3,2,2,1,1 graphical? If not, explain why; if so, find a simple graph with this degree 
sequence. 
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Is 4, 4,4, 2,2 graphical? If not, explain why, and find a multigraph (no loops) with this degree 
sequence; if so, find a simple graph with this degree sequence. 


Prove that a simple graph with n > 2 vertices has two vertices of the same degree. 
Prove the “only if” part of theorem 5.1.3. 


Show that the condition on the degrees in theorem 5.1.3 is equivalent to this condition: 
oF, di is even and for all k € {1,2,...,n}, and all {t1,i2,...,%%} C [n], 


k 
Sidi, <k(R-1I+ So min(di,h). 
j=l CE {i1 ,t2,---5t } 
Do not use theorem 5.1.3. 
Draw the 11 non-isomorphic graphs with four vertices. 
Suppose G, ~ G2. Show that if G1 contains a cycle of length k so does Go. 


Define v ~ w if and only if there is a path connecting vertices v and w. Prove that ~ is an 
equivalence relation. 


Prove the “if” part of theorem 5.1.3, as follows: 


The proof is by induction on s = }>"_, d;. This is easy to see if s = 2, so suppose s > 2. 
Without loss of generality we may suppose that d, > 0. Let t be the least integer such that 
di > di41, or t= n— 1 if there is no such integer. Let d; = d; — 1, d}, = dn — 1, and d; = d; 
for all other i. Note that d, > d; > ---d',. We want to show that the sequence {dj} satisfies 
the condition of the theorem, that is, that for all k € {1,2,...,n}, 


n 


k 
Sod; <k(k-1)+ S> min(dij, k). 
i=1 


i=k+1 
There are five cases: 
1.k>t 
-_k<t,de<k 
-k<t,dh=k 
-_k<t,d,>k 
-k<t,dk>k, dn <k 


a Bw bd 


By the induction hypothesis, there is a simple graph with degree sequence {d;}. Finally, 
show that there is a graph with degree sequence {d;}. 

This proof is due to S. A. Choudum, A Simple Proof of the Erdés-Gallai Theorem on 
Graph Sequences, Bulletin of the Australian Mathematics Society, vol. 33, 1986, pp. 67-70. 
The proof by Paul Erdés and Tibor Gallai was long; Berge provided a shorter proof that 
used results in the theory of network flows. Choudum’s proof is both short and elementary. 


5.2 EULER CIRCUITS AND WALKS 


The first problem in graph theory dates to 1735, and is called the Seven Bridges of 


Konigsberg. In Konigsberg were two islands, connected to each other and the mainland 


by seven bridges, as shown in figure 5.2.1. The question, which made its way to Euler, 
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Figure 5.2.1 The Seven Bridges of K6nigsberg. 


was whether it was possible to take a walk and cross over each bridge exactly once; Euler 
showed that it is not possible. 


We can represent this problem as a graph, as in figure 5.2.2. 


Figure 5.2.2 The Seven Bridges of Konigsberg as a graph. 


The two sides of the river are represented by the top and bottom vertices, and the 
islands by the middle two vertices. There are two possible interpretations of the question, 
depending on whether the goal is to end the walk at its starting point. Perhaps inspired 
by this problem, a walk in a graph is defined as follows. 


DEFINITION 5.2.1 A walk in a graph is a sequence of vertices and edges, 


U1, €1, U2, €2,---,Uk, Ck, Uk+1 


98 Chapter 5 Graph Theory 


such that the endpoints of edge e; are v; and v;41. In general, the edges and vertices may 
appear in the sequence more than once. If vy = vz4i1, the walk is a closed walk or a 


circuit. 


We will deal first with the case in which the walk is to start and end at the same place. 
A successful walk in K6nigsberg corresponds to a closed walk in the graph in which every 
edge is used exactly once. 

What can we say about this walk in the graph, or indeed a closed walk in any graph 
that uses every edge exactly once? Such a walk is called an Euler circuit. If there are 
no vertices of degree 0, the graph must be connected, as this one is. Beyond that, imagine 
tracing out the vertices and edges of the walk on the graph. At every vertex other than 
the common starting and ending point, we come into the vertex along one edge and go 
out along another; this can happen more than once, but since we cannot use edges more 
than once, the number of edges incident at such a vertex must be even. Already we see 
that we’re in trouble in this particular graph, but let’s continue the analysis. The common 
starting and ending point may be visited more than once; except for the very first time 
we leave the starting vertex, and the last time we arrive at the vertex, each such visit 
uses exactly two edges. Together with the edges used first and last, this means that the 
starting vertex must also have even degree. Thus, since the Konigsberg Bridges graph has 
odd degrees, the desired walk does not exist. 

The question that should immediately spring to mind is this: if a graph is connected 
and the degree of every vertex is even, is there an Euler circuit? The answer is yes. 


THEOREM 5.2.2 _ If G is a connected graph, then G contains an Euler circuit if and 
only if every vertex has even degree. 


Proof. We have already shown that if there is an Euler circuit, all degrees are even. 

We prove the other direction by induction on the number of edges. If G has no edges 
the problem is trivial, so we assume that G has edges. 

We start by finding some closed walk that does not use any edge more than once: 
Start at any vertex vo; follow any edge from this vertex, and continue to do this at each 
new vertex, that is, upon reaching a vertex, choose some unused edge leading to another 
vertex. Since every vertex has even degree, it is always possible to leave a vertex at which 
we arrive, until we return to the starting vertex, and every edge incident with the starting 
vertex has been used. The sequence of vertices and edges formed in this way is a closed 
walk; if it uses every edge, we are done. 

Otherwise, form graph G’ by removing all the edges of the walk. G’ is not connected, 
since vertex vg is not incident with any remaining edge. The rest of the graph, that is, G’ 
without vp, may or may not be connected. It consists of one or more connected subgraphs, 
each with fewer edges than G; call these graphs G1, Go,...,G,. Note that when we remove 
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the edges of the initial walk, we reduce the degree of every vertex by an even number, so 
all the vertices of each graph G; have even degree. By the induction hypothesis, each G; 
has an Euler circuit. These closed walks together with the original closed walk use every 
edge of G exactly once. 


Suppose the original closed walk is v9,v1,...,Um = Uo, abbreviated to leave out the 
edges. Because G is connected, at least one vertex in each G; appears in this sequence, 
say vertices wi; € Gi, Wo1 € Go,..., Wea © Gx, listed in the order they appear in 
U0, U1,+++,;Um- The Euler circuits of the graphs G; are 

W115 W1,2)---)Wiym, = W111 
W2,1,W2,2,+++,W2m2 = W2,1 
Wk,1, Wk,2)+++,Wk,m, = Wk,1- 


By pasting together the original closed walk with these, we form a closed walk in G that 
uses every edge exactly once: 


U0s U1, +++) Vi, = W1,1,W1,2,--+, Wim, = Viz, Viz+1; 
1+) Vig = W2)1,--+)W2,me = Vig; Vig+1; 
2+ Vin = Wk,1,-- -, Wkymp = Vins Vigtlo-+> 5 Um = Vo- 


Now let’s turn to the second interpretation of the problem: is it possible to walk over 
all the bridges exactly once, if the starting and ending points need not be the same? In a 
graph G, a walk that uses all of the edges but is not an Euler circuit is called an Euler 
walk. It is not too difficult to do an analysis much like the one for Euler circuits, but it 
is even easier to use the Euler circuit result itself to characterize Euler walks. 


THEOREM 5.2.3 A connected graph G has an Euler walk if and only if exactly two 
vertices have odd degree. 


Proof. Suppose first that G has an Euler walk starting at vertex v and ending at vertex 
w. Add a new edge to the graph with endpoints v and w, forming G’. G’ has an Euler 
circuit, and so by the previous theorem every vertex has even degree. The degrees of v 
and w in G are therefore odd, while all others are even. 

Now suppose that the degrees of v and w in G are odd, while all other vertices have 
even degree. Add a new edge e to the graph with endpoints v and w, forming G’. Every 
vertex in G’ has even degree, so by the previous theorem there is an Euler circuit which 
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we can write as 


V, €1, U2, €2,---,W,€,U 
so that 
V, €1, V2, €2,---,W 
is an Euler walk. a 
Exercises 5.2. 
1. Suppose a connected graph G has degree sequence dj, d2,...,dn. How many edges must be 


added to G so that the resulting graph has an Euler circuit? Explain. 


2. Which complete graphs K,, n > 2, have Euler circuits? Which have Euler walks? Justify 
your answers. 


3. Prove that if vertices v and w are joined by a walk they are joined by a path. 


4. Show that if G is connected and has exactly 2k vertices of odd degree, k > 1, its edges can 
be partitioned into k walks. Is this true for non-connected G? 


5.38 HAMILTON CYCLES AND PATHS 


Here is a problem similar to the K6nigsberg Bridges problem: suppose a number of cities 
are connected by a network of roads. Is it possible to visit all the cities exactly once, 
without traveling any road twice? We assume that these roads do not intersect except at 
the cities. Again there are two versions of this problem, depending on whether we want to 
end at the same city in which we started. 

This problem can be represented by a graph: the vertices represent cities, the edges 
represent the roads. We want to know if this graph has a cycle, or path, that uses every 
vertex exactly once. (Recall that a cycle in a graph is a subgraph that is a cycle, and a 
path is a subgraph that is a path.) There is no benefit or drawback to loops and multiple 
edges in this context: loops can never be used in a Hamilton cycle or path (except in the 
trivial case of a graph with a single vertex), and at most one of the edges between two 
vertices can be used. So we assume for this discussion that all graphs are simple. 


DEFINITION 5.3.1 A cycle that uses every vertex in a graph exactly once is called 
a Hamilton cycle, and a path that uses every vertex in a graph exactly once is called a 


Hamilton path. 


Unfortunately, this problem is much more difficult than the corresponding Euler circuit 
and walk problems; there is no good characterization of graphs with Hamilton paths and 
cycles. Note that if a graph has a Hamilton cycle then it also has a Hamilton path. 

There are some useful conditions that imply the existence of a Hamilton cycle or path, 
which typically say in some form that there are many edges in the graph. An extreme 
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example is the complete graph K,,: it has as many edges as any simple graph on n vertices 
can have, and it has many Hamilton cycles. 

The problem for a characterization is that there are graphs with Hamilton cycles that 
do not have very many edges. The simplest is a cycle, C,,: this has only n edges but has a 
Hamilton cycle. On the other hand, figure 5.3.1 shows graphs with just a few more edges 
than the cycle on the same number of vertices, but without Hamilton cycles. 


Figure 5.3.1 <A graph with a Hamilton path but not a Hamilton cycle, and one with 
neither. 


There are also graphs that seem to have many edges, yet have no Hamilton cycle, as 
indicated in figure 5.3.2. 


Figure 5.3.2 <A graph with many edges but no Hamilton cycle: a complete graph Ky ~1 
joined by an edge to a single vertex. This graph has Ca) + 1 edges. 


The key to a successful condition sufficient to guarantee the existence of a Hamilton 
cycle is to require many edges at lots of vertices. 


THEOREM 5.3.2 (Ore) If G is a simple graph on n vertices, n > 3, and d(v) + 
d(w) >n whenever v and w are not adjacent, then G has a Hamilton cycle. 


Proof. First we show that G is connected. If not, let v and w be vertices in two different 
connected components of G, and suppose the components have n; and nz vertices. Then 
d(v) <n, — 1 and d(w) < ng —1, so d(v) + d(w) < ni + ng -—2 <n. But since v and w 
are not adjacent, this is a contradiction. 


Now consider a longest possible path in G: v1, v2,...,Uz. Suppose, for a contradiction, 
that k < n, so there is some vertex w adjacent to one of v2,3,...,Uz—1, Say to v;. If 
v1 is adjacent to vz, then w, v;,Ui41,---,Uk,V1,V2,;---Us-1 iS a path of length k+1, a 


contradiction. Hence, v; is not adjacent to vz, and so d(v,) + d(v,) > n. The neighbors of 
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vy, are among {v2,U3,...,Uz—1} as are the neighbors of vz. Consider the vertices 

W = {vi41 | vi is a neighbor of vz}. 
Then |N(v,)| = |W| and W C {v3,v4,..., ue} and N(v1) © {ve2,u3,...,ve—-1}, so WU 
N(v1) C {ve, v3,..-, Ue}, a set with k — 1 <n elements. Since |N(v1)| + |W] = |N(v1)| + 
|N(vz)| =n, N(v1) and W must have a common element, v;; note that 3 < 7 < k-1. 
Then this is a cycle of length k: 

U1, Uj, Ujtis+++5 Uk, Uj—-1, Uj—-2,+++,U1- 
We can relabel the vertices for convenience: 

U1 = W1, W2,--.-,Wk = V2, U1- 

Now as before, w is adjacent to some w;, and w, wi, Wi41,---; Wk, W1, W2,--.Wi-1 is a path 
of length k+1, acontradiction. Thus, k = n, and, renumbering the vertices for convenience, 
we have a Hamilton path v1, v2,...,Un. If vy is adjacent to v,, there is a Hamilton cycle, 
as desired. 


If v1 is not adjacent to vp, the neighbors of v1 are among {v2,v3,...,Un—1} as are the 
neighbors of v,. Consider the vertices 


W = {vi41 | y is a neighbor of v,}. 


Then |N(v,)| = |W| and W C {v3,u4,...,Un}, and N(v1) C {ve,v3,...,Un—1}, so W U 
N(v1) C {v2, u3,---, Un}, a set with n — 1 <n elements. Since |N(v1)| + |W] = |N(e1)| + 
|N(vn)| > n, N(v1) and W must have a common element, v;; note that 3 <i<n-l1. 
Then this is a cycle of length n: 


U1, Vi, Vi41,---, Uk, Vi-1, Vi-2,---,U1; 


and is a Hamilton cycle. = 


The property used in this theorem is called the Ore property; if a graph has the Ore 
property it also has a Hamilton path, but we can weaken the condition slightly if our goal 
is to show there is a Hamilton path. The proof of this theorem is nearly identical to the 
preceding proof. 


THEOREM 5.3.3 If Gis a simple graph on n vertices and d(v)+d(w) > n—1 whenever 
v and w are not adjacent, then G has a Hamilton path. = 


Suppose G is not simple. The existence of multiple edges and loops can’t help produce 
a Hamilton cycle when n > 3: if we use a second edge between two vertices, or use a loop, 
we have repeated a vertex. To extend the Ore theorem to multigraphs, we consider the 
condensation of G: When n > 3, the condensation of G is simple, and has a Hamilton 
cycle if and only if G has a Hamilton cycle. So if the condensation of G satisfies the Ore 
property, then G has a Hamilton cycle. 
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Exercises 5.3. 


—1)\(n-2 
1. Suppose a simple graph G on n vertices has at least (wa Din=?) +2 edges. Prove that G 
ao ax (n= In 2) 
has a Hamilton cycle. For n > 2, show that there is a simple graph with 5 + 1 


edges that has no Hamilton cycle. 
2. Prove theorem 5.3.3. 


3. The graph shown below is the Petersen graph. Does it have a Hamilton cycle? Justify your 
answer. Does it have a Hamilton path? Justify your answer. 


5.4 BIPARTITE GRAPHS 


We have already seen how bipartite graphs arise naturally in some circumstances. Here 
we explore bipartite graphs a bit more. 

It is easy to see that all closed walks in a bipartite graph must have even length, 
since the vertices along the walk must alternate between the two parts. Remarkably, the 


converse is true. We need one new definition: 


DEFINITION 5.4.1 The distance between vertices v and w, d(v,w), is the length of 
a shortest walk between the two. If there is no walk between v and w, the distance is 


undefined. 


THEOREM 5.4.2. G is bipartite if and only if all closed walks in G are of even length. 


Proof. The forward direction is easy, as discussed above. 

Now suppose that all closed walks have even length. We may assume that G is con- 
nected; if not, we deal with each connected component separately. 

Let v be a vertex of G, let X be the set of all vertices at even distance from v, and Y 
be the set of vertices at odd distance from v. We claim that all edges of G join a vertex 
of X to a vertex of Y. Suppose not; then there are adjacent vertices u and w such that 
d(v, wu) and d(v, w) have the same parity. Then there is a closed walk from v to u to w to 
v of length d(v, u) + 1+ d(v,w), which is odd, a contradiction. | 
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The closed walk that provides the contradiction is not necessarily a cycle, but this can 
be remedied, providing a slightly different version of the theorem. 


COROLLARY 5.4.3 G is bipartite if and only if all cycles in G are of even length. 


Proof. Again the forward direction is easy, and again we assume G is connected. As 
before, let v be a vertex of G, let X be the set of all vertices at even distance from v, 
and Y be the set of vertices at odd distance from v. If two vertices in X are adjacent, or 
two vertices in Y are adjacent, then as in the previous proof, there is a closed walk of odd 
length. 

To finish the proof, it suffices to show that if there is a closed walk W of odd length 
then there is a cycle of odd length. The proof is by induction on the length of the closed 
walk. 

If W has no repeated vertices, we are done. Otherwise, suppose the closed walk is 


VU = U1, €1,.--, Uj = U,.--, Uk = VU = U1. 


Then 


C0 ves = Bnd CSO6. Use eS 0 


are closed walks, both are shorter than the original closed walk, and one of them has odd 
length. By the induction hypothesis, there is a cycle of odd length. r 


It is frequently fruitful to consider graph properties in the limited context of bipartite 
graphs (or other special types of graph). For example, what can we say about Hamilton 
cycles in simple bipartite graphs? Suppose the partition of the vertices of the bipartite 
graph is X and Y. Because any cycle alternates between vertices of the two parts of the 
bipartite graph, if there is a Hamilton cycle then |X| = |Y| > 2. In such a case, the degree 
of every vertex is at most n/2, where n is the number of vertices, namely n = |X| + |Y]. 
Thus the Ore condition (d(v) + d(w) > n when v and w are not adjacent) is equivalent 
to d(v) = n/2 for all v. This means the only simple bipartite graph that satisfies the Ore 
condition is the complete bipartite graph K,,/2,,/2, in which the two parts have size 
n/2 and every vertex of X is adjacent to every vertex of Y. The upshot is that the Ore 
property gives no interesting information about bipartite graphs. 

Of course, as with more general graphs, there are bipartite graphs with few edges and 
a Hamilton cycle: any even length cycle is an example. 

We note that, in general, a complete bipartite graph K,,,, is a bipartite graph with 
|X| =m, |Y| =n, and every vertex of X is adjacent to every vertex of Y. The only such 
graphs with Hamilton cycles are those in which m = n. 
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Exercises 5.4. 


1. Prove that there is a bipartite multigraph with degree sequence dj,...,dn if and only if there 
is a partition {J, J} of [n] such that 
a= Ta 


ie] ied 

2. What is the smallest number of edges that can be removed from Ks to create a bipartite 
graph? 

3. A regular graph is one in which the degree of every vertex is the same. Show that if G is 


a regular bipartite graph, and the common degree of the vertices is at least 1, then the two 
parts are the same size. 


4. A perfect matching is one in which all vertices of the graph are incident with exactly one 
edge in the matching. Show that a regular bipartite graph with common degree at least 1 
has a perfect matching. (We discussed matchings in section 4.5.) 


5.5 TREES 
Another useful special class of graphs: 


DEFINITION 5.5.1 <A connected graph G is a tree if it is acyclic, that is, it has no 
cycles. More generally, an acyclic graph is called a forest. 


Two small examples of trees are shown in figure 5.1.5. Note that the definition implies 
that no tree has a loop or multiple edges. 


THEOREM 5.5.2 Every tree T' is bipartite. 


Proof. Since T has no cycles, it is true that every cycle of T’ has even length. By 
corollary 5.4.3, T’ is bipartite. = 


DEFINITION 5.5.3 A vertex of degree one is called a pendant vertex, and the edge 
incident to it is a pendant edge. o 


THEOREM 5.5.4 _ Every tree on two or more vertices has at least one pendant vertex. 


Proof. We prove the contrapositive. Suppose graph G has no pendant vertices. Starting 
at any vertex v, follow a sequence of distinct edges until a vertex repeats; this is possible 
because the degree of every vertex is at least two, so upon arriving at a vertex for the first 
time it is always possible to leave the vertex on another edge. When a vertex repeats for 
the first time, we have discovered a cycle. rT 


This theorem often provides the key step in an induction proof, since removing a 
pendant vertex (and its pendant edge) leaves a smaller tree. 
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THEOREM 5.5.5 _ A tree on n vertices has exactly n — 1 edges. 


Proof. A tree on 1| vertex has 0 edges; this is the base case. 

If T is a tree on n > 2 vertices, it has a pendant vertex. Remove this vertex and its 
pendant edge to get a tree JT” on n—1 vertices. By the induction hypothesis, T’ has n — 2 
edges; thus T’ has n — 1 edges. rT] 


THEOREM 5.5.6 _ A tree with a vertex of degree k > 1 has at least k pendant vertices. 
In particular, every tree on at least two vertices has at least two pendant vertices. 


Proof. The case k = 1 is obvious. Let 7’ be a tree with n vertices, degree sequence 
{d;}7_,, and a vertex of degree k > 2, and let | be the number of pendant vertices. 
Without loss of generality, 1 = dj = dg =--- = d; and dj4; =k. Then 


n 


An-1)=Sods=l+k+ So dj >l+k+2(n—-1-1). 


i=l t=142 


This reduces to | > k, as desired. 

If T is a tree on two vertices, each of the vertices has degree 1. If T’ has at least three 
vertices it must have a vertex of degree k > 2, since otherwise 2(n — 1) = Sy, di =n, 
which implies n = 2. Hence it has at least k > 2 pendant vertices. | 


Trees are quite useful in their own right, but also for the study of general graphs. 


DEFINITION 5.5.7 If G is a connected graph on n vertices, a spanning tree for G 
is a subgraph of G that is a tree on n vertices. 


THEOREM 5.5.8 Every connected graph has a spanning tree. 


Proof. By induction on the number of edges. If G is connected and has zero edges, it is 
a single vertex, so G is already a tree. 

Now suppose G' has m > 1 edges. If G is a tree, it is its own spanning tree. Otherwise, 
G contains a cycle; remove one edge of this cycle. The resulting graph G’ is still connected 
and has fewer edges, so it has a spanning tree; this is also a spanning tree for G. = 


In general, spanning trees are not unique, that is, a graph may have many spanning 
trees. It is possible for some edges to be in every spanning tree even if there are multiple 
spanning trees. For example, any pendant edge must be in every spanning tree, as must 
any edge whose removal disconnects the graph (such an edge is called a bridge.) 


COROLLARY 5.5.9 If G is connected, it has at least n — 1 edges; moreover, it has 
exactly n — 1 edges if and only if it is a tree. 
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Proof. If G is connected, it has a spanning tree, which has n — 1 edges, all of which are 
edges of G. 
If G has n—1 edges, which must be the edges of its spanning tree, thenGisatree. & 


THEOREM 5.5.10 G is a tree if and only if there is a unique path between any two 


vertices. 


Proof. 

if: Since every two vertices are connected by a path, G is connected. For a contradiction, 
suppose there is a cycle in G; then any two vertices on the cycle are connected by at least 
two distinct paths, a contradiction. 

only if: If G is a tree it is connected, so between any two vertices there is at least one 
path. For a contradiction, suppose there are two different paths from v to w: 


U= 0, Vac, SW and B= Wi, Wes .2. 5 wy — we. 


Let 7 be the smallest integer such that v; 4 w;. Then let 7 be the smallest integer greater 
than or equal to i such that w; = v,, for some m, which must be at least i. (Since w; = vp, 


such an m must exist.) Then 14-1, Ui, ...,Um = Wj, Wj-1,---,Wi-1 = Ui-1 is a cycle in G, 
a contradiction. See figure 5.5.1. a 
Wj 
U1 U2 U3 Um Uk 
a a ) e 
Wi Wa W3 WI 
Wi Wj—1 


Figure 5.5.1 Distinct paths imply the existence of a cycle. 


DEFINITION 5.5.11 A cutpoint in a connected graph G is a vertex whose removal 
disconnects the graph. 


THEOREM 5.5.12 Every connected graph has a vertex that is not a cutpoint. 


Proof. Remove a pendant vertex in a spanning tree for the graph. a 


Exercises 5.5. 


1. Suppose that G is a connected graph, and that every spanning tree contains edge e. Show 
that e is a bridge. 


2. Show that every edge in a tree is a bridge. 
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3. Show that G is a tree if and only if it has no cycles and adding any new edge creates a graph 
with exactly one cycle. 


4. Which trees have Euler walks? 

5. Which trees have Hamilton paths? 

6. Let n > 2. Show that there is a tree with degree sequence dj, d2,..., dy if and only if d; > 0 
for-all ¢ and. 57)", d= 2(~ 1). 

7. A multitree is a multigraph whose condensation is a tree. Let n > 2. Let di,do,...,dn be 
positive integers, and let g be the greatest common divisor of the d;. Show that there is 
a multitree with degree sequence dj, d2,...,dn if and only if )>", di/g > 2(n — 1) and for 
some partition I, J of [n], 0,-,di = Vie di- 


8. Suppose T is a tree on n vertices, k of which have degree larger than 1, di, d2,...d,x. Of 
course, 7’ must also have pendant vertices. How many pendant vertices? Your answer should 
depend only on k and dj, d2,... dx. 


5.6 OPTIMAL SPANNING TREES 


In some applications, a graph G is augmented by associating a weight or cost with each 
edge; such a graph is called a weighted graph. For example, if a graph represents a 
network of roads, the weight of an edge might be the length of the road between its two 
endpoints, or the amount of time required to travel from one endpoint to the other, or 
the cost to bury cable along the road from one end to the other. In such cases, instead of 
being interested in just any spanning tree, we may be interested in a least cost spanning 
tree, that is, a spanning tree such that the sum of the costs of the edges of the tree is as 
small as possible. For example, this would be the least expensive way to connect a set of 
towns by a communication network, burying the cable in such a way as to minimize the 
total cost of laying the cable. 

This problem is one that can be solved by a greedy algorithm. Roughly speaking, 
a greedy algorithm is one that makes choices that are optimal in the short run. Typically 
this strategy does not result in an optimal solution in the long run, but in this case this 
approach works. 


DEFINITION 5.6.1 A weighted graph is a graph G together with a cost function 
c: E(G) + R*°. If H is a subgraph of G, the cost of H is c(H) = ee R(H) Cle): 


The Jarnik Algorithm. Given a weighted connected graph G, we construct a minimum 
cost spanning tree JT as follows. Choose any vertex vg in G and include it in T. If 
vertices S = {vo,v1,...,vx%} have been chosen, choose an edge with one endpoint in S and 
one endpoint not in S and with smallest weight among all such edges. Let vz41 be the 
endpoint of this edge not in S, and add it and the associated edge to T. Continue until 
all vertices of G are in T. 
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This algorithm was discovered by Vojtéch Jarnik in 1930, and rediscovered indepen- 
dently by Robert C. Prim in 1957 and Edsger Dijkstra in 1959. It is often called Prim’s 
Algorithm. 

The algorithm proceeds by constructing a sequence of trees JT), 7>,...,Tn—1, with T,_-1 
a spanning tree for G. At each step, the algorithm adds an edge that will make c(7;,,) as 
small as possible among all trees that consist of T; plus one edge. This is the best choice 
in the short run, but it is not obvious that in the long run, that is, by the time T),_1 is 
constructed, that this will turn out to have been the best choice. 


THEOREM 5.6.2 The Jarnik Algorithm produces a minimum cost spanning tree. 


Proof. Suppose G is connected on n vertices. Let T’ be the spanning tree produced by 
the algorithm, and T,,, a minimum cost spanning tree. We prove that c(T’) = c(Tin). 

Let €1,€2,..-,€n—1 be the edges of T in the order in which they were added to T; 
one endpoint of e; is v;, the other is in {vo,...,vi-1}. We form a sequence of trees 
Tm = To,Ti,.-.,;Tn—-1 = T such that for each i, c(T;) = c(Tj41), and we conclude that 
Chaar: 

If e, isin To, let 7; = Tp. Otherwise, add edge e; to Tp. This creates a cycle containing 
€, and another edge incident at vo, say f;. Remove f; to form T,. Since the algorithm 
added edge e1, c(e1) < c(fi). If c(e1) < e(fi), then c(T1) < c(To) = c(Tim), a contradiction, 
so c(e,) = c(f,) and c(T1) = c(To). 

Suppose we have constructed tree 7;. If e;41 is in Jj, let Tj41 = T;. Otherwise, add 
edge e;41 to T;. This creates a cycle, one of whose edges, call it fj+1, is not in e1, €2,..., €; 
and has exactly one endpoint in {vg,...,v;}. Remove fj, to create T;11. Since the 
algorithm added ej41, c(ei4i) < c(fizi). If cleizi) < c(fi4i), then c(Ti4i) < c(T;) = 


c(Tm), a contradiction, so c(e;41) = c(fi¢i1) and c(Tj41) = c(Tj). . 


Exercises 5.6. 


1. Kruskal’s Algorithm is also a greedy algorithm that produces a minimum cost spanning tree 
for a connected graph G. Begin by choosing an edge in G of smallest cost. Assuming that 
edges €1, €2,...,e; have been chosen, pick an edge e;41 that does not form a cycle together 
with e1,¢2,...,e; and that has smallest cost among all such edges. The edges €1, €2,...,€n—1 
form a spanning tree for G. Prove that this spanning tree has minimum cost. 


2. Prove that if the edge costs of G are distinct, there is exactly one minimum cost spanning 
tree. Give an example of a graph G with more than one minimum cost spanning tree. 


3. In both the Jarnik and Kruskal algorithms, it may be that two or more edges can be added 
at any particular step, and some method is required to choose one over the other. For the 
graph below, use both algorithms to find a minimum cost spanning tree. Using the labels 
e; on the graph, at each stage pick the edge e; that the algorithm specifies and that has the 
lowest possible 2 among all edges available. For the Jarnik algorithm, use the designated 
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vo as the starting vertex. For each algorithm, list the edges in the order in which they are 
added. The edge weights e1,e2,...,€10 are 6, 7,8, 2,3, 2, 4,6, 1,1, shown in red. 


e€6/2 


€10/1 


5.7 CONNECTIVITY 


We have seen examples of connected graphs and graphs that are not connected. While “not 
connected” is pretty much a dead end, there is much to be said about “how connected” 
a connected graph is. The simplest approach is to look at how hard it is to disconnect a 
graph by removing vertices or edges. We assume that all graphs are simple. 

If it is possible to disconnect a graph by removing a single vertex, called a cutpoint, 
we say the graph has connectivity 1. If this is not possible, but it is possible to disconnect 
the graph by removing two vertices, the graph has connectivity 2. 


DEFINITION 5.7.1 _ If a graph G is connected, any set of vertices whose removal 
disconnects the graph is called a cutset. G has connectivity k if there is a cutset of size k 
but no smaller cutset. If there is no cutset and G has at least two vertices, we say G has 
connectivity n —1; if G has one vertex, its connectivity is undefined. If G is not connected, 
we say it has connectivity 0. G is k-connected if the connectivity of G is at least k. The 


connectivity of G is denoted «(G). 


As you should expect from the definition, there are graphs without a cutset: the 
complete graphs K,. If G is connected but not a K,,, it has vertices v and w that are 
not adjacent, so removing the n — 2 other vertices leaves a non-connected graph, and so 
the connectivity of G is at most n — 2. Thus, only the complete graphs have connectivity 
n—1. 

We do the same thing for edges: 


DEFINITION 5.7.2 If a graph G is connected, any set of edges whose removal dis- 
connects the graph is called a cut. G has edge connectivity k if there is a cut of size k 
but no smaller cut; the edge connectivity of a one-vertex graph is undefined. G is k-edge- 


5.7 Connectivity 111 


connected if the edge connectivity of G is at least k. The edge connectivity is denoted 


NG): 


Any connected graph with at least two vertices can be disconnected by removing 
edges: by removing all edges incident with a single vertex the graph is disconnected. 
Thus, A(G) < 6(G), where 6(G) is the minimum degree of any vertex in G. Note that 
6(G) <n—-1, 80 A(G) <n-1. 

Removing a vertex also removes all of the edges incident with it, which suggests that 
kK(G) < (G). This turns out to be true, though not as easy as you might hope. We write 
G — v to mean G with vertex v removed, and G — {v1, v2,...,uz%} to mean G with all of 
{v1,V2,---,Ux} removed, and similarly for edges. 


THEOREM 5.7.3 «(G) < X(G). 


Proof. We use induction on A = X(G). If A = 0, G is disconnected, so k = 0. If A = 1, 
removal of edge e with endpoints v and w disconnects G. If v and w are the only vertices 
of G, G is Kg and has connectivity 1. Otherwise, removal of one of v and w disconnects 
G,son=1. 

As a special case we note that if \=n-—1 thend=n-—1,soGis K, andkt=n-1. 

Now suppose n — 1 > A =k > 1, and removal of edges €1,e€2,...,e% disconnects G. 
Remove edge e, with endpoints v and w to form G; with \(G,) = k—1. By the induction 
hypothesis, there are at most k—1 vertices v1, v2,...,v; such that Gy = G1—{v1, ve,...,u,;} 
is disconnected. Since k <n —1, k—1<n-—83, and so G2 has at least 3 vertices. 

If both v and w are vertices of Gz, and if adding e, to Gz produces a connected 
graph G3, then removal of one of v and w will disconnect G3 forming G4, and G4 = 
G — {v}, v2,...,U;,v} or G4 = G— {v1, v2,...,;,w}, that is, removing at most k vertices 
disconnects G. If v and w are vertices of Gz but adding e, does not produce a connected 
graph, then removing v1, v2,...,v; disconnects G. Finally, if at least one of v and w is not 
in Gg, then Gy = G — {vj,v2,...,v,;} and the connectivity of G is less than k. So in all 
cases, Kk <k. | 


Graphs that are 2-connected are particularly important, and the following simple the- 


orem is useful. 


THEOREM 5.7.4 _ If G has at least three vertices, the following are equivalent: 
1. G is 2-connected 
2. G is connected and has no cutpoint 


3. For all distinct vertices u, v, w in G there is a path from u to v that does not 


contain w. 
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Proof. 1 = 3: Since G is 2-connected, G with w removed is a connected graph G’. 
Thus, in G’ there is a path from u to v, which in G is a path from u to v avoiding w. 

3 => 2: If G has property 3 it is clearly connected. Suppose that w is a cutpoint, so that 
G’ = G— w is disconnected. Let u and v be vertices in two different components of G’, 
so that no path connects them in G’. Then every path joining u to v in G must use w, a 
contradiction. 

2 = 1: Since G has at least 3 vertices and has no cutpoint, its connectivity is at least 2, 
so it is 2-connected by definition. a 


There are other nice characterizations of 2-connected graphs. 


THEOREM 5.7.5 If G has at least three vertices, then G is 2-connected if and only if 


every two vertices u and v are contained in a cycle. 


Proof. 

if: Suppose vertex w is removed from G, and consider any other vertices u and v. In G, u 
and v lie on a cycle; even if w also lies on this cycle, then u and v are still connected by a 
path when w is removed. 

only if: Given wu and v we want to show there is a cycle containing both. Let U be 
the set of vertices other than u that are contained in a cycle with u. First, we show that 
U is non-empty. Let w be adjacent to u, and remove the edge e between them. Since 
A(G) > K(G) > 2, G —e is connected. Thus, there is a path from u to w; together with e 
this path forms a cycle containing u and w, so w € U. 

For a contradiction, suppose v ¢ U. Let w be in U with d(w,v) > 1 as small as 
possible, fix a cycle C containing u and w and a path P of length d(w,v) from w to v. 
By the previous theorem, there is a path Q from u to v that does not use w. Following 
this path from u, there is a last vertex x on the path that is also on the cycle containing 
u and w, and there is a first vertex y on the path, after x, with y also on the path from 
w to v (it is possible that y = v, but not that y = w); see figure 5.7.1. Now starting at u, 
proceeding on cycle C to x without using w, then from x to y on Q, then to w on P, and 
finally back to u on C, we see that y € U. But y is closer to v than is w, a contradiction. 
Hence v € U. a 


The following corollary is an easy restatement of this theorem. 
COROLLARY 5.7.6 If G has at least three vertices, then G is 2-connected if and only 


if between every two vertices u and v there are two internally disjoint paths, that is, 
paths that share only the vertices u and v. rT 


This version of the theorem suggests a generalization: 
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Figure 5.7.1 Point y closer to v than w is a contradiction; path Q is shown dashed. (See 
theorem 5.7.5.) 


THEOREM 5.7.7 Menger’s Theorem _ If G has at least k +1 vertices, then G is 
k-connected if and only if between every two vertices u and v there are k pairwise internally 


disjoint paths. 


We first prove Menger’s original version of this, a “local” version. 
y] 


DEFINITION 5.7.8 If v and w are non-adjacent vertices in G, Kg(v, w) is the smallest 
number of vertices whose removal separates v from w, that is, disconnects G leaving v and 
w in different components. A cutset that separates v and w is called a separating set 
for v and w. pg(v,w) is the maximum number of internally disjoint paths between v and 


W. 


THEOREM 5.7.9 _ If v and w are non-adjacent vertices in G, Kg(v, w) = pa(v, w). 


Proof. If there are k internally disjoint paths between v and w, then any set of vertices 
whose removal separates v from w must contain at least one vertex from each of the k 
paths, so Kg(v, w) > pe(v, w). 

To finish the proof, we show that there are Kg(v, w) internally disjoint paths between 
v and w. The proof is by induction on the number of vertices in G. If G has two vertices, 
G is not connected, and kg(v,w) = pa(v,w) = 0. Now suppose G has n > 2 vertices and 
Rea (0,0) = ke 

Note that removal of either N(v) or N(w) separates v from w, so no separating set S$ 
of size k can properly contain N(v) or N(w). Now we address two cases: 
Case 1: Suppose there is a set S of size k that separates v from w, and S contains a vertex 
not in N(v) or N(w). G—S is disconnected, and one component G'; contains v. Since S 
does not contain N(v), Gi has at least two vertices; let X = V(Gi) and Y = V(G)-—S-X. 
Since S does not contain N(w), Y contains at least two vertices. Now we form two new 
graphs: Form Hx by starting with G—Y and adding a vertex y adjacent to every vertex 
of S. Form Hy by starting with G — X and adding a vertex x adjacent to every vertex 
of S; see figure 5.7.2. Since X and Y each contain at least two vertices, Hx and Hy are 
smaller than G, and so the induction hypothesis applies to them. 
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Clearly S separates v from y in Hx and w from x in Hy. Moreover, any set that 
separates vu from y in Hx separates v from w in G, so ky,.(v,y) = kG(v,w) =k. Similarly, 
KHy(£2,w) = KG(v,w) = k. Hence, by the induction hypothesis, there are k internally 
disjoint paths from v to y in Hx and k internally disjoint paths from x to w in Hy. Each 
of these paths uses one vertex of S; by eliminating x and y and joining the paths at the 
vertices of S', we produce k internally disjoint paths from v to w. 


Figure 5.7.2 Case 1: Top figure is G, lower left is Hx, lower right is Hy. 


Case 2: Now we suppose that any set S separating v and w is a subset of N(v) U N(w); 
pick such an S. If there is a vertex u not in {v,w}U N(v) U N(w), consider G — u. This 
u is not in any set of size k that separates v from w, for if it were we would be in Case 
1. Since S separates v from w in G — u, KG—u(v,w) < k. But if some smaller set S’ 
separates v from w in G — u, then S’U {u} separates v from w in G, a contradiction, so 
KG—u(v, w) = k. By the induction hypothesis, there are k internally disjoint paths from v 
to w in G— wu and hence in G. 

We are left with V(G) = {v,w} U N(v) U N(w). Suppose there is a vertex u in 
N(v) 1 N(w). Then wu is in every set that separates v from w, so Kg_y = k — 1. By the 
induction hypothesis, there are k — 1 internally disjoint paths from v to w in G — u and 
together with the path v, u, w, they comprise k& internally disjoint paths from v to w in G. 

Finally, suppose that N(v) M N(w) = @. Form a bipartite graph B with vertices 
N(v)U N(w) and any edges of G that have one endpoint in N(v) and the other in N(w). 
Every set separating v from w in G must include one endpoint of every edge in B, that is, 
must be a vertex cover in B, and conversely, every vertex cover in B separates v from w 
in G. Thus, the minimum size of a vertex cover in B is k, and so there is a matching in B 
of size k, by theorem 4.5.6. The edges of this matching, together with the edges incident 
at v and w, form k internally disjoint paths from v to w in G. = 
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Proof of Menger’s Theorem (5.7.7). | Suppose first that between every two vertices 
v and w in G there are k internally disjoint paths. If G is not k-connected, the connectivity 
of G is at most k — 1, and because G has at least k + 1 vertices, there is a cutset S of G 
with size at most k — 1. Let v and w be vertices in two different components of G — S; in 
G these vertices are joined by k internally disjoint paths. Since there is no path from v to 
w in G—S, each of these k paths contains a vertex of S, but this is impossible since S has 
size less than k, and the paths share no vertices other than v and w. This contradiction 
shows that G is k-connected. 

Now suppose G is k-connected. 

If v and w are not adjacent, Kg(v,w) > k and by the previous theorem there are 
pa(v, w) = kg(v, w) internally disjoint paths between v and w. 

If v and w are connected by edge e, consider G — e. Suppose there is a separating set 
S for v and w with |S| < k—1, that is, k¢_-(v,w) < k—1. S cannot be a cutset for G, 
so G—S is connected. Then G —e—S must have exactly two components, since G — S is 
G—e-—S plus a single edge. G—e—S has at least k + 1 — (k — 2) = 3 vertices, so one 
of the two components contains at least two vertices; without loss of generality, say u and 
w are in a single component. Then G — (SU {w}) is not connected, but |S U {w}| < k, 
contradicting the assumption that G is k-connected. Thus, kg_-(v,w) > k — 1, so there 
are k — 1 internally disjoint paths between v and w in G—e. Together with the path from 
v to w using e, this gives k internally disjoint paths between v and w in G, as desired. 


We return briefly to 2-connectivity. The next theorem can sometimes be used to 
provide the induction step in an induction proof. 


THEOREM 5.7.10 The Handle Theorem Suppose G is 2-connected and K is a 
2-connected proper subgraph of G. Then there are subgraphs L and H (the handle) of G 
such that D is 2-connected, L contains K, H is a simple path, LZ and H share exactly the 
endpoints of H, and G is the union of L and H. 


Proof. Given G and K, let L be a maximal proper subgraph of G containing K. If 
V(L) = V(G), let e be an edge not in L. Since L plus the edge e is 2-connected, it must 
be G, by the maximality of L. Hence H is the path consisting of e and its endpoints. 

Suppose that v is in V(G) but not V(L). Let u be a vertex of L. Since G is 2- 
connected, there is a cycle containing v and u. Following the cycle from v to u, Let w be 
the first vertex in L. Continuing on the cycle from wu to v, let x be the last vertex in L. Let 
P be the path continuing around the cycle: (x, v1, v2,..-, Uk, U = Vk-41; Uk+2;++-;Um, Ww). If 
x #w, let H = P. Since L together with H is 2-connected, it is G, as desired. 
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Ifx=w then z= w =u. Let y be a vertex of LZ other than u. Since G is 2-connected, 
there is a path P; from v to y that does not include u. Let v; be the last vertex on P; that 
is in {v1,...,U,.--,Um}; without loss of generality, suppose 7 > k+ 1. Then let H be the 
path (u,v1,...,U = Ur+i,---,Uj,---,y), where from uv; to y we follow path P,. Now LUH 
is a 2-connected subgraph of G, but it is not G, as it does not contain the edge {u, vm}, 
contradicting the maximality of L. Thus x 4 w. = 


A graph that is not connected consists of connected components. In a theorem rem- 
iniscent of this, we see that connected graphs that are not 2-connected are constructed 
from 2-connected subgraphs and bridges. 


DEFINITION 5.7.11 A block in a graph G is a maximal induced subgraph on at 
least two vertices without a cutpoint. 


As usual, maximal here means that the induced subgraph B cannot be made larger by 
adding vertices or edges (while retaining the desired property, in this case, no cutpoints). A 
block is either a 2-connected induced subgraph or a single edge together with its endpoints. 
Blocks are useful in large part because of this theorem: 


THEOREM 5.7.12 The blocks of G partition the edges. 


Proof. We need to show that every edge is in exactly one block. If an edge is in no 
2-connected induced subgraph of G, then, together with its endpoints, it is itself a block. 
Thus, every edge is in some block. 

Now suppose that B,; and Bz are distinct blocks. This implies that neither is a sub- 
graph of the other, by the maximality condition. Hence, the induced subgraph G[V(B,) U 
V(Bg)] is larger than either of B; and By. Suppose B; and Bo share an edge, so that they 
share the endpoints of this edge, say u and v. Supppose w is a vertex in V(B,) U V(B2). 
Since B, — w and By — w are connected, so is G[(V(B,) UV(B2))\{w}], because either 
u or v is in (V(B,) UV(Bo2))\{w}. Thus G[V(B,) U V(B2)] has no cutpoint but strictly 
contains B, and Bo, contradicting the maximality property of blocks. Thus, every edge is 
in at most one block. = 


If G has a single block, it is either Ky or is 2-connected, and any 2-connected graph 
has a single block. 


THEOREM 5.7.13 If G is connected but not 2-connected, then every vertex that is in 
two blocks is a cutpoint of G. 


Proof. Suppose w is in B, and Bo, but G — w is connected. Then there is a path 
V1, U2,--.,U, in G—w, with v; € By, and vy € Bo. But then G[V(B1) U V(Be) U 
{V1,V2,-.-,Uz}] has no cutpoint and contains both B, and Bs, a contradiction. = 
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Exercises 5.7. 


10. 


11. 


(n — 1)(n — 2) 


. Suppose a simple graph G on n > 2 vertices has at least + 1 edges. Prove 


2 
that G is connected. 


. Suppose a general graph G has exactly two odd-degree vertices, v and w. Let G’ be the 


graph created by adding an edge joining v to w. Prove that G’ is connected if and only if G 
is connected. 


. Suppose G is simple with degree sequence d, < dz <--- <dn, and fork < n—d,—1, dy > k. 


Show G is connected. 


. Recall that a graph is k-regular if all the vertices have degree k. What is the smallest integer 


k that makes this true: 
If G is simple, has n vertices, m > k, and G is m-regular, then G is connected. 


(Of course k depends on n.) 


. Suppose G has at least one edge. Show that G is 2-connected if and only if for all vertices uv 


and edges e there is a cycle containing v and e. 


. Find a simple graph with «(G) < »(G) < 6(G). 
. Suppose A(G) = k > 0. Show that there are sets of vertices U and V that partition the 


vertices of G, and such that there are exactly k edges with one endpoint in U and one 
endpoint in V. 


. Find A\(Km,n), where both m and n are at least 1. (Km,n is the complete bipartite graph on 


m and n vertices: the parts have m and n vertices, and every pair of vertices, one from each 
part, is connected by an edge.) 


. Suppose G is a connected graph. The block-cutpoint graph of G, BC(G) is formed as 


follows: Let vertices c1,c2,..., Cz be the cutpoints of G, and let the blocks of G be Bi,..., Bi. 
The vertices of BC(G) are ci,...,cx, Bi,..., Bi. Add an edge { B;,c;} if and only if c; € B;. 
Show that the block-cutpoint graph is a tree. 

Note that a cutpoint is contained in at least two blocks, so that all pendant vertices of 
the block-cutpoint graph are blocks. These blocks are called endblocks. 


Draw the block-cutpoint graph of the graph below. 


Show that the complement of a disconnected graph is connected. Is the complement of a 
connected graph always disconnected? (The complement G of graph G has the same vertices 
as G, and {v,w} is an edge of G if and only if it is not an edge of G.) 
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5.8 GRAPH COLORING 


As we briefly discussed in section 1.1, the most famous graph coloring problem is certainly 
the map coloring problem, proposed in the nineteenth century and finally solved in 1976. 


DEFINITION 5.8.1 A proper coloring of a graph is an assignment of colors to the 


vertices of the graph so that no two adjacent vertices have the same color. 


Usually we drop the word “proper” unless other types of coloring are also under dis- 
cussion. Of course, the “colors” don’t have to be actual colors; they can be any distinct 
labels—integers, for example. If a graph is not connected, each connected component can 
be colored independently; except where otherwise noted, we assume graphs are connected. 
We also assume graphs are simple in this section. 

Graph coloring has many applications in addition to its intrinsic interest. 


EXAMPLE 5.8.2 Ifthe vertices of a graph represent academic classes, and two vertices 
are adjacent if the corresponding classes have people in common, then a coloring of the 
vertices can be used to schedule class meetings. Here the colors would be schedule times, 
such as 8MWF, IMWF, 11TTh, etc. Oo 


EXAMPLE 5.8.3 _ If the vertices of a graph represent radio stations, and two vertices 
are adjacent if the stations are close enough to interfere with each other, a coloring can be 


used to assign non-interfering frequencies to the stations. 


EXAMPLE 5.8.4 If the vertices of a graph represent traffic signals at an intersection, 
and two vertices are adjacent if the corresponding signals cannot be green at the same 
time, a coloring can be used to designate sets of signals than can be green at the same 


time. 


Graph coloring is closely related to the concept of an independent set. 


DEFINITION 5.8.5 A set S of vertices in a graph is independent if no two vertices of 
S are adjacent. 


If a graph is properly colored, the vertices that are assigned a particular color form an 
independent set. Given a graph G it is easy to find a proper coloring: give every vertex a 
different color. Clearly the interesting quantity is the minimum number of colors required 
for a coloring. It is also easy to find independent sets: just pick vertices that are mutually 
non-adjacent. A single vertex set, for example, is independent, and usually finding larger 
independent sets is easy. The interesting quantity is the maximum size of an independent 
set. 
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DEFINITION 5.8.6 The chromatic number of a graph G is the minimum number 
of colors required in a proper coloring; it is denoted y(G). The independence number 


of G is the maximum size of an independent set; it is denoted a(G). 


The natural first question about these graphical parameters is: how small or large 
can they be in a graph G with n vertices. It is easy to see that 
1G) <n 
l<a(G)<n 
and that the limits are all attainable: A graph with no edges has chromatic number 1 and 
independence number n, while a complete graph has chromatic number n and independence 
number 1. These inequalities are thus not very interesting. We will see some that are more 
interesting. 
Another natural question: What is the relation between the chromatic number of a 
graph G and chromatic number of a subgraph of G? This too is simple, but quite useful 


at times. 


THEOREM 5.8.7 _ If H is a subgraph of G, x(H) < x(G). 


Proof. Any coloring of G provides a proper coloring of H, simply by assigning the same 
colors to vertices of H that they have in G. This means that H can be colored with x(G) 
colors, perhaps even fewer, which is exactly what we want. = 


Often this fact is interesting “in reverse”. For example, if G has a subgraph H that 
is a complete graph K,,, then x(H) = m and so x(G) > m. A subgraph of G that is a 
complete graph is called a clique, and there is an associated graphical parameter. 


DEFINITION 5.8.8 The clique number of a graph G is the largest m such that Ki, 
is a subgraph of G. 


It is tempting to speculate that the only way a graph G could require m colors is 
by having such a subgraph. This is false; graphs can have high chromatic number while 
having low clique number; see figure 5.8.1. It is easy to see that this graph has y > 3, 
because there are many 3-cliques in the graph. In general it can be difficult to show that 
a graph cannot be colored with a given number of colors, but in this case it is easy to see 
that the graph cannot in fact be colored with three colors, because so much is “forced”. 
Suppose the graph can be colored with 3 colors. Starting at the left if vertex v, gets color 
1, then vg and v3 must be colored 2 and 3, and vertex v4 must be color 1. Continuing, v1 
must be color 1, but this is not allowed, so y > 3. On the other hand, since vjg can be 
colored 4, we see x = 4. 

Paul Erd6és showed in 1959 that there are graphs with arbitrarily large chromatic 
number and arbitrarily large girth (the girth is the size of the smallest cycle in a graph). 
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UI V10 
U4 


U3 
Figure 5.8.1 <A graph with clique number 3 and chromatic number 4. 


This is much stronger than the existence of graphs with high chromatic number and low 
clique number. 

Bipartite graphs with at least one edge have chromatic number 2, since the two parts 
are each independent sets and can be colored with a single color. Conversely, if a graph can 
be 2-colored, it is bipartite, since all edges connect vertices of different colors. This means 
it is easy to identify bipartite graphs: Color any vertex with color 1; color its neighbors 
color 2; continuing in this way will or will not successfully color the whole graph with 
2 colors. If it fails, the graph cannot be 2-colored, since all choices for vertex colors are 
forced. 

If a graph is properly colored, then each color class (a color class is the set of all 
vertices of a single color) is an independent set. 


THEOREM 5.8.9 In any graph G on n vertices, e < Xx. 
a 


Proof. Suppose G is colored with y colors. Since each color class is independent, the 


size of any color class is at most a. Let the color classes be V;, V2,...,Vy. Then 
x 
n=) [Vil < xa, 
i=1 
as desired. 2 


We can improve the upper bound on x(G) as well. In any graph G, A(G) is the 


maximum degree of any vertex. 


THEOREM 5.8.10 In any graph G, y < A+1. 


Proof. We show that we can always color G with A+ 1 colors by a simple greedy 
algorithm: Pick a vertex v,, and list the vertices of G as v1, v2,...,Un so that if 7 < j, 
d(vi,Un) > d(vj, Un), that is, we list the vertices farthest from v, first. We use integers 
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1,2,...,4+1 as colors. Color v, with 1. Then for each v; in order, color v; with the 
smallest integer that does not violate the proper coloring requirement, that is, which is 
different than the colors already assigned to the neighbors of v;. For 7 <n, we claim that 
v; is colored with one of 1,2,...,A. 

This is certainly true for vy}. For 1 <i< n, v; has at least one neighbor that is not 
yet colored, namely, a vertex closer to v, on a shortest path from v, to v;. Thus, the 


neighbors of v; use at most A — 1 colors from the colors 1,2,...,A, leaving at least one 
color from this list available for v;. 

Once v1,...,Un—1 have been colored, all neighbors of v,, have been colored using the 
colors 1,2,...,A, so color A +1 may be used to color vn. = 


Note that if d(v,) < A, even v, may be colored with one of the colors 1,2,...,A. 
Since the choice of v, was arbitrary, we may choose v, so that d(v,) < A, unless all 
vertices have degree A, that is, if G is regular. Thus, we have proved somewhat more than 
stated, namely, that any graph G that is not regular has y < A. (If instead of choosing 
the particular order of v1,...,Un that we used we were to list them in arbitrary order, even 
vertices other than v, might require use of color A+ 1. This gives a slightly simpler proof 
of the stated theorem.) We state this as a corollary. 


COROLLARY 5.8.11 If G is not regular, y < A. = 


There are graphs for which y = A+ 1: any cycle of odd length has A = 2 and y = 8, 
and K, has A=n-—1 and xy = n. Of course, these are regular graphs. It turns out that 
these are the only examples, that is, if G is not an odd cycle or a complete graph, then 


x(G) < A(G). 


THEOREM 5.8.12 Brooks’s Theorem _ If G is a graph other than K,, or Con+1, 
¥ <A. a 


The greedy algorithm will not always color a graph with the smallest possible number 
of colors. Figure 5.8.2 shows a graph with chromatic number 3, but the greedy algorithm 
uses 4 colors if the vertices are ordered as shown. 

In general, it is difficult to compute (G), that is, it takes a large amount of compu- 
tation, but there is a simple algorithm for graph coloring that is not fast. Suppose that v 
and w are non-adjacent vertices in G. Denote by G+ {v,w} = G+e the graph formed by 
adding edge e = {v,w} to G. Denote by G/e the graph in which v and w are “identified” , 
that is, v and w are replaced by a single vertex x adjacent to all neighbors of v and w. 
(But note that we do not introduce multiple edges: if u is adjacent to both v and w in G, 
there will be a single edge from x to u in G/e.) 

Consider a proper coloring of G in which v and w are different colors; then this is a 
proper coloring of G+ as well. Also, any proper coloring of G+ e is a proper coloring of 
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U5 U2 U5 V2 


V4 U3 V4 U3 
3 2 2 3 


Figure 5.8.2 A greedy coloring on the left and best coloring on the right. 


G in which v and w have different colors. So a coloring of G+ e with the smallest possible 
number of colors is a best coloring of G in which v and w have different colors, that is, 
x(G + e) is the smallest number of colors needed to color G so that v and w have different 
colors. 

If G is properly colored and v and w have the same color, then this gives a proper 
coloring of G/e, by coloring x in G/e with the same color used for v and w in G. Also, if 
G/e is properly colored, this gives a proper coloring of G in which v and w have the same 
color, namely, the color of x in G/e. Thus, x(G/e) is the smallest number of colors needed 
to properly color G so that v and w are the same color. 

The upshot of these observations is that x(G) = min(yv(G+e), y(G/e)). This algorithm 
can be applied recursively, that is, if G; = G+e and Gz = G/e then x(G1) = min(y(Gi + 
e),x(Gi/e)) and x(G2) = min(x(G2 +e), x(G2/e)), where of course the edge e is different 
in each graph. Continuing in this way, we can eventually compute y(G), provided that 
eventually we end up with graphs that are “simple” to color. Roughly speaking, because 
G/e has fewer vertices, and G + e has more edges, we must eventually end up with a 
complete graph along all branches of the computation. Whenever we encounter a complete 
graph K,, it has chromatic number m, so no further computation is required along the 
corresponding branch. Let’s make this more precise. 


THEOREM 5.8.13 The algorithm above correctly computes the chromatic number in 


a finite amount of time. 


Proof. Suppose that a graph G has n vertices and m edges. The number of pairs of 
2 


If na(G) = 0 then G is a complete graph and the algorithm terminates immediately. 
Now we note that na(G + e) < na(G) and na(G/e) < na(G): 


non-adjacent vertices is na(G) = ( ) —m. The proof is by induction on na. 


na(G +e) = (5 


) —(m+1) =na(G)-1 
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and 


na(G/e) = ("5") ~ (m6), 


where c is the number of neighbors that v and w have in common. Then 
-—1 
na(G/e) = v ) =e 


—1 
as G )-m4n-2 


= 5 —-m+n-—2 
_nn-1) 2(n—1) 

-_ 5 5 mtn—2 
-()J-m- 

= na(G) — 1. 


Now if na(G) > 0, G is not a complete graph, so there are non-adjacent vertices v and 
w. By the induction hypothesis the algorithm computes x(G + e) and x(G/e) correctly, 
and finally it computes x(G) from these in one additional step. = 


While this algorithm is very inefficient, it is sufficiently fast to be used on small graphs 
with the aid of a computer. 


EXAMPLE 5.8.14 We illustrate with a very simple graph: 


A | x] 
3 4 
The chromatic number of the graph at the top is min(3, 4) = 3. (Of course, this is quite 


easy to see directly.) 


Exercises 5.8. 


k 
= 


2. Find the chromatic number of the graph below by using the algorithm in this section. Draw 
all of the graphs G + e and G/e generated by the alorithm in a “tree structure” with the 
complete graphs at the bottom, label each complete graph with its chromatic number, then 
propogate the values up to the original graph. 


1. Suppose G has n vertices and chromatic number k. Prove that G has at least ( ) edges. 
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3. Show that x(G — v) is either y(G) or x(G) — 1. 
4. Prove theorem 5.8.10 without assuming any particular properties of the order v1,..., Un. 


5. Prove theorem 5.8.12 as follows. By corollary 5.8.11 we need consider only regular graphs. 
Regular graphs of degree 2 are easy, so we consider only regular graphs of degree at least 3. 


If G is not 2-connected, show that the blocks of G may colored with A(G) colors, and 
then the colorings may be altered slightly so that they combine to give a proper coloring of 


G. 


If G is 2-connected, show that there are vertices u, v, w such that u is adjacent to both 
v and w, v and w are not adjacent, and G — v — w is connected. Given such vertices, color v 
and w with color 1, then color the remaining vertices by a greedy algorithm similar to that 
in theorem 5.8.10, with wu playing the role of un. 


To show the existence of u, v, w as required, let x be a vertex not adjacent to all other 
vertices. If G — x is 2-connected, let v = a, let w be at distance 2 from v (justify this), and 
let a path of length 2 be v,u,w. Use theorem 5.7.4 to show that u, v, w have the required 
properties. 


If G — x is not 2-connected, let wu = x and let v and w be (carefully chosen) vertices in 
two different endblocks of G — x. Show that u, v, w have the required properties. 


Brooks proved the theorem in 1941; this simpler proof is due to Lovadsz, 1975. 


5.9 THE CHROMATIC POLYNOMIAL 


We now turn to the number of ways to color a graph G with k colors. Of course, if 
k < x(G), this is zero. We seek a function Pg(k) giving the number of ways to color G 
with k colors. Some graphs are easy to do directly. 


EXAMPLE 5.9.1 If Gis Kn, Po(k) = k(k —1)(k — 2)---(K —n +1), namely, the 
number of permutations of k things taken n at a time. Vertex 1 may be colored any of the 


k colors, vertex 2 any of the remaining k — 1 colors, and so on. Note that when k < n, 
n 


Pg(k) = 0. By exercise 5 in section 1.8, we may also write Po(k) = S- s(n,i)k*. 
i=0 


EXAMPLE 5.9.2 If G has n vertices and no edges, Pg(k) = k”. 


Given Pg it is not hard to compute x(G); for example, we could simply plug in the 
numbers 1,2,3,... for & until P¢(k) is non-zero. This suggests it will be difficult (that is, 
time consuming) to compute Pg. We can provide an easy mechanical procedure for the 
computation, quite similar to the algorithm we presented for computing y(G). 
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Suppose G has edge e = {v,w}, and consider Pg_.(k), the number of ways to color 
G —e with k colors. Some of the colorings of G — e are also colorings of G, but some are 
not, namely, those in which v and w have the same color. How many of these are there? 
From our discussion of the algorithm for x(G) we know this is the number of colorings of 
G/e. Thus, 
Pe(k) = Pe—-e(k) — Paye(k). 


Since G — e and G/e both have fewer edges than G, we can compute Pg by applying this 
formula recursively. Ultimately, we need only compute Pg for graphs with no edges, which 
is easy, as in example 5.9.2. 

Since Po(k) = k” when G has no edges, it is then easy to see, and to prove by 
induction, that Pg is a polynomial. 


THEOREM 5.9.3 For all G on n vertices, Pg is a polynomial of degree n, and Pg is 
called the chromatic polynomial of G. 


Proof. ‘The proof is by induction on the number of edges in G. When G has no edges, 
this is example 5.9.2. 
Otherwise, by the induction hypothesis, Pg—e¢ is a polynomial of degree n and Pg/- is 
a polynomial of degree n — 1, so Pg = Pg—e — Pa@/e is a polynomial of degree n. a 
The chromatic polynomial of a graph has a number of interesting and useful properties, 
some of which are explored in the exercises. 


Exercises 5.9. 


1. Show that the leading coefficient of Pg is 1. 


2. Suppose that G is not connected and has components C1,...,Cx. Show that Pg = nan Po;. 

3. Show that the constant term of Pg(k) is 0. Show that the coefficient of k in Pe(k) is non-zero 
if and only if G is connected. 

4. Show that the coefficient of k”~' in Pg is —1 times the number of edges in G. 

5. Show that G is a tree if and only if Pg(k) = k(k—1)""". 

6. Find the chromatic polynomial of K, with one edge removed. 

7. Find the chromatic polynomial of the cycle C,, n > 3. 


5.10 COLORING PLANAR GRAPHS 


Now we return to the original graph coloring problem: coloring maps. As indicated in 
section 1.1, the map coloring problem can be turned into a graph coloring problem. Fig- 
ure 5.10.1 shows the example from section 1.1. 

Graphs formed from maps in this way have an important property: they are planar. 
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Figure 5.10.1. <A map and its corresponding graph. 


DEFINITION 5.10.1 A graph G is planar if it can be represented by a drawing in the 
plane so that no edges cross. 


Note that this definition only requires that some representation of the graph has no 
crossing edges. Figure 5.10.2 shows two representations of K4; since in the second no edges 
cross, Ky is planar. 


Figure 5.10.2 K4 drawn in two ways; the second shows that it is planar. 


The number of colors needed to properly color any map is now the number of colors 
needed to color any planar graph. This problem was first posed in the nineteenth century, 
and it was quickly conjectured that in all cases four colors suffice. This was finally proved 
in 1976 (see figure 5.10.3) with the aid of a computer. In 1879, Alfred Kempe gave a 
proof that was widely known, but was incorrect, though it was not until 1890 that this 
was noticed by Percy Heawood, who modified the proof to show that five colors suffice to 
color any planar graph. We will prove this Five Color Theorem, but first we need some 
other results. We assume all graphs are simple. 


THEOREM 5.10.2 Euler’s Formula Suppose G is a connected planar graph, drawn 
so that no edges cross, with n vertices and m edges, and that the graph divides the plane 
into r regions. Then 

r=m—-—n+2. 
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Figure 5.10.3. The postmark on University of Illinois mail after the Four Color Theorem 
was proved. 


Proof. The proof is by induction on the number of edges. The base case is m = n — 1, 
the minimum number of edges in a connected graph on n vertices. In this case G is a tree, 
and contains no cycles, so the number of regions is 1, and indeed 1 = (n—1) —n+2. 
Now suppose G has more than n — 1 edges, so it has a cycle. Remove one edge from 
a cycle forming G’, which is connected and has r — 1 regions, n vertices, and m — 1 edges. 
By the induction hypothesis r — 1 = (m — 1) —n+ 2, which becomes r = m —n+2 when 
we add 1 to each side. = 


LEMMA 5.10.3 Suppose G is a simple connected planar graph, drawn so that no edges 
cross, with n > 3 vertices and m edges, and that the graph divides the plane into r regions. 
Then m < 3n —6. 


Proof. Let f; be the number of edges that adjoin region number 7; if the same region is 
on both sides of an edge, that edge is counted twice. We call the edges adjoining a region 
the boundary edges of the region. Since G is simple and n > 3, every region is bounded 
by at least 3 edges. Then }*\_, f; = 2m, since each edge is counted twice, once for the 
region on each side of the edge. From r = m—n+2 we get 3r = 3m—3n+6, and because 
pa eS ee fi = 2m, so 3m — 8n + 6 < 2m, or m < 3n — 6 as desired. | 


THEOREM 5.10.4 _ Ks is not planar. 

Proof. Ks has 5 vertices and 10 edges, and 10 < 3-5 —6, so by the lemma, Ks is not 
planar. = 
LEMMA 5.10.5 If G is planar then G has a vertex of degree at most 5. 


Proof. We may assume that G is connected (if not, work with a connected component 
of G). Suppose that d(v;) > 5 for all v;. Then 2m = )>"_, d(v;) > 6n. By lemma 5.10.3, 
3n —6 >m so 6n — 12 > 2m. Thus 6n < 2m < 6n — 12, a contradiction. | 


THEOREM 5.10.6 Five Color Theorem — Every planar graph can be colored with 
5 colors. 


Proof. The proof is by induction on the number of vertices n; when n < 5 this is trivial. 
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Now suppose G is planar on more than 5 vertices; by lemma 5.10.5 some vertex v has 
degree at most 5. By the induction hypothesis, G — v can be colored with 5 colors. Color 
the vertices of G, other than v, as they are colored in a 5-coloring of G — v. If d(v) < 4, 
then v can be colored with one of the 5 colors to give a proper coloring of G with 5 colors. 
So we now suppose d(v) = 5. If the five neighbors of v are colored with four or fewer of 
the colors, then again v can be colored to give a proper coloring of G with 5 colors. 

Now we suppose that all five neighbors of v have a different color, as indicated in 
figure 5.10.4. 


V4 U3 
4 3 


Figure 5.10.4 Five neighbors of v colored with 5 colors: v; is red, v2 is purple, vg is green, 
va is blue, vs is orange. 


Suppose that in G there is a path from v; to v3, and that the vertices along this path 
are alternately colored red and green; call such a path a red-green alternating path. Then 
together with v, this path makes a cycle with vg on the inside and v4 on the outside, 
or vice versa. This means there cannot be a purple-blue alternating path from v2 to v4. 
Supposing that v2 is inside the cycle, we change the colors of all vertices inside the cycle 
colored purple to blue, and all blue vertices are recolored purple. This is still a proper 
coloring of all vertices of G except v, and now no neighbor of v is purple, so by coloring v 
purple we obtain a proper coloring of G. 

If there is no red-green alternating path from v, to v3, then we recolor vertices as 
follows: Change the color of vy to green. Change all green neighbors of v; to red. Continue 
to change the colors of vertices from red to green or green to red until there are no conflicts, 
that is, until a new proper coloring is obtained. Because there is no red-green alternating 
path from vz to v3, the color of v3 will not change. Now no neighbor of v is colored red, 


so by coloring v red we obtain a proper coloring of G. a 


Exercises 5.10. 


1. Show k3.3 is not planar. (Prove a lemma like lemma 5.10.3 for bipartite graphs, then do 
something like the proof of theorem 5.10.4.) What is the chromatic number of K3,3? 
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5.11 DIRECTED GRAPHS 


A directed graph, also called a digraph, is a graph in which the edges have a direction. 
This is usually indicated with an arrow on the edge; more formally, if v and w are vertices, 
an edge is an unordered pair {v,w}, while a directed edge, called an arc, is an ordered 
pair (v, w) or (w,v). The arc (v, w) is drawn as an arrow from v to w. If a graph contains 
both arcs (v,w) and (w,v), this is not a “multiple edge”, as the arcs are distinct. It is 
possible to have multiple arcs, namely, an arc (v,w) may be included multiple times in 
the multiset of arcs. As before, a digraph is called simple if there are no loops or multiple 
arcs. 

We denote by BE. the set of all arcs of the form (w,v), and by Ej the set of arcs of 
the form (v,w). The indegree of v, denoted d (v), is the number of arcs in FE, , and 
the outdegree, at (v), is the number of arcs in Et. If the vertices are v1, v2,...,Un, the 
degrees are usually denoted d; ,dz,...,d;, and dj,d3,...,d%. Note that both )77_, d, 


diid. 54 dy count the number of arcs exactly once, and of course )7;9d, = )oj_od, . 


A walk in a digraph is a sequence v1, €1, V2, €2,---,Uk—1; €k—1, Uk Such that e, = (vj, vi41)3 
if vy = vg, it is a closed walk or a circuit. A path in a digraph is a walk in which all 
vertices are distinct. It is not hard to show that, as for graphs, if there is a walk from v to 
w then there is a path from v to w. 

Many of the topics we have considered for graphs have analogues in digraphs, but 
there are many new topics as well. We will look at one particularly important result in 
the latter category. 


DEFINITION 5.11.1 A network is a digraph with a designated source s and target 
t#s. In addition, each arc e has a positive capacity, c(e). 


Networks can be used to model transport through a physical network, of a physical 
quantity like oil or electricity, or of something more abstract, like information. 


DEFINITION 5.11.2 A flow in a network is a function f from the arcs of the digraph 
to R, with 0 < f(e) < c(e) for all e, and such that 


» fe)= dS fe), 


eCEt e€E,, 


for all v other than s and t. 


We wish to assign a value to a flow, equal to the net flow out of the source. Since 
the substance being transported cannot “collect” or “originate” at any vertex other than 
s and t, it seems reasonable that this value should also be the net flow into the target. 
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Before we prove this, we introduce some new notation. Suppose that U is a set of 
vertices in a network, with s € U and t ¢ U. Let U be the set of arcs (v,w) with v € U, 
w ¢U, and U be the set of arcs (v,w) with v € U, w € Uz 


THEOREM 5.11.3 For any flow f in a network, the net flow out of the source is equal 


to the net flow into the target, namely, 


YIO=)  f0= > 70S fe: 


ec Et eCE, eC€E, ec Ey 


Proof. We will show first that for any U with s € U andt ¢ U, 
SS fle- S| fe) = > FO- YS Fle). 
e€EL eck, Pi sel 


Consider the following: 


S=S° | So fO- YS fe) 


vEU ecEt eck, 


The quantity 


S> He) - SZ FO 


e€ Et eCE, 

is zero except when v = s, by the definition of a flow. Thus, the entire sum S' has value 
SFO] doa): 
e€ Ey eC Ey 

On the other hand, we can write the sum S as 


SOD Te 


veU eeRt veU ceEeE, 


Every arc e = (x,y) with both x and y in U appears in both sums, that is, in 


yD) 


veEU e€ Et 


when v = 2, and in 


ye Utes 


veU ec ED 


when v = y, and so the flow in such arcs contributes 0 to the overall value. Thus, only arcs 
with exactly one endpoint in U make a non-zero contribution, so the entire sum reduces 
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to 
fos) FO) 
ecU ecU 
Thus 
>» fO- dU f@=S= db) fO- Dd fe), 
ecES eck, ig sell 
as desired. 


Now let U consist of all vertices except t. Then 


S16 =)>, 10= dX fe) - d_ Fe) = 110 = >) FO) 


+ - - + 
e€E3 eC€k, ecU ec U e€E, e€E, 


finishing the proof. = 


DEFINITION 5.11.4 The value of a flow, denoted val(f), is Pec e+ fle)—Lecr- fle). 


A maximum flow in a network is any flow f whose value is the maximum among all 


flows. 


We next seek to formalize the notion of a “bottleneck”, with the goal of showing that 
the maximum flow is equal to the amount that can pass through the smallest bottleneck. 


DEFINITION 5.11.5 A cut in a network is a set C' of arcs with the property that 
every path from s to t uses an arc in C, that is, if the arcs in C’ are removed from the 
network there is no path from s to t. The capacity of a cut, denoted c(C), is 


S- c(e). 


e€C 


A minimum cut is one with minimum capacity. A cut Cis minimal if no cut is properly 


contained in C’. 


Note that a minimum cut is a minimal cut. Clearly, if U is a set of vertices containing 
s but not t, then U is a cut. 


LEMMA 5.11.6 Suppose C is a minimal cut. Then there is a set U containing s but 
not t such that C = U. 


Proof. Let U be the set of vertices v such that there is a path from s to v using no arc 
in C. 

Suppose that e = (v,w) € C. Since C is minimal, there is a path P from s to t using 
e but no other arc in C’. Thus, there is a path from s to v using no arc of C, so v € U. If 
there is a path from s to w using no arc of C’, then this path followed by the portion of P 
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that begins with w is a walk from s to t using no arc in C. This implies there is a path 
from s to t using no arc in C, a contradiction. Thus w ¢ U and so e € U. Hence, C' C ur. 
Suppose that e = (v,w) € U. Then v € U and w ¢ U, so every path from s to w uses 
an arc in C’.. Since v € U, there is a path from s to v using no arc of C, and this path 
followed by e is a path from s to w. Hence the arc e must be in C, so U CC, 
We have now shown that C = U. = 


Now we can prove a version of the important max-flow, min cut theorem. 


THEOREM 5.11.7 Suppose in a network all arc capacities are integers. Then the 
value of a maximum flow is equal to the capacity of a minimum cut. Moreover, there is a 


maximum flow f for which all f(e) are integers. 


Proof. First we show that for any flow f and cut C, val(f) < c(C). It suffices to show 
this for a minimum cut C’, and by lemma 5.11.6 we know that C= U for some U. Using 
the proof of theorem 5.11.3 we have: 


val(f)= > #e)- VY FOS YO fs Y ele) =e). 
Pe i Sig au eu 


Now if we find a flow f and cut C with val(f) = c(C), it follows that f is a maximum flow 
and C' is a minimum cut. We present an algorithm that will produce such an f and C. 
Given a flow f, which may initially be the zero flow, f(e) = 0 for all arcs e, do the 
following: 
0. Let U = {s}. 
Repeat the next two steps until no new vertices are added to U. 
1. If there is an arc e = (v,w) with v € U and w ¢ U, and f(e) < c(e), add w to U. 
2. If there is an arc e = (v,w) with v ¢ U and w € U, and f(e) > 0, add v to U. 
When this terminates, either t € U or t € U. If t € U, there is a sequence of distinct 
vertices 8 = U1, V2,U3,--.,Uk =t such that for each i, 1 <i < k, either e = (vj, vi41) is an 
arc with f(e) < c(e) or e = (u;41, 4%) is an arc with f(e) > 0. Update the flow by adding 
1 to f(e) for each of the former, and subtracting 1 from f(e) for each of the latter. This 
new flow f’ is still a flow: In the first case, since f(e) < c(e), f’(e) < c(e), and in the 
second case, since f(e) > 0, f’(e) > 0. It is straightforward to check that for each vertex 


uj, 1<1< k, that 
SX le= SS 
ec EY, e€Ey, 


In addition, val(f’) = val(f) +1. Now rename f’ to f and repeat the algorithm. 
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Eventually, the algorithm terminates with t ¢ U and flow f. This implies that for 
each e = (v,w) with v € U and w ¢ U, f(e) = c(e), and for each e = (v,w) with v ¢ U 
and w € U, f(e) =0. The capacity of the cut U is 


>, c(e). 
ee U 
The value of the flow f is 


dF) = d, Fe) = de) - Dai de). 


ecU eeU ecU eeU ee U 
Thus we have found a flow f and cut U such that 
val(f) = e(U), 
as desired. = 


The max-flow, min-cut theorem is true when the capacities are any positive real num- 
bers, though of course the maximum value of a flow will not necessarily be an integer in 
this case. It is somewhat more difficult to prove; a proof involves limits. 

We have already proved that in a bipartite graph, the size of a maximum matching is 
equal to the size of a minimum vertex cover, theorem 4.5.6. This turns out to be essentially 
a special case of the max-flow, min-cut theorem. 


COROLLARY 5.11.8 Ina bipartite graph G, the size of a maximum matching is the 


same as the size of a minimum vertex cover. 


Proof. Suppose the parts of G are X = {x1,%2,...,¢~} and Y = {y1, y2,..., yi}. Create 
a network as follows: introduce two new vertices s and ¢ and arcs (s,x;) for all 7 and (yj, t) 
for all i. For each edge {2;,y;} in G, let (a;,y;) be an arc. Let c(e) = 1 for all arcs e. Now 
the value of a maximum flow is equal to the capacity of a minimum cut. 

Let C be a minimum cut. If (x;,y;) is an arc of C, replace it by arc (s,x;). This is 
still a cut, since any path from s to t including (x;, y;) must include (s, x;). Thus, we may 
suppose that C' contains only arcs of the form (s,x;) and (y;,t). Now it is easy to see that 


K = {ail(s, vi) € CHU {yal yas t) € Ch 


is a vertex cover of G with the same size as C’. 
Let f be a maximum flow such that f(e) is an integer for all e, and val(f) = c(C), 
which is possible by the max-flow, min-cut theorem. Consider the set of edges 


M = (tai, ug tf (ta, 93) = Ue 


If {x;,y;} and {x;,ym} are both in this set, then the flow out of vertex x; is at least 2, 
but there is only one arc into 2;, (s,x2;), with capacity 1, contradicting the definition of a 
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flow. Likewise, if {v;,y;} and {v,y;} are both in this set, then the flow into vertex y, is 
at least 2, but there is only one arc out of y;, (y;,t), with capacity 1, also a contradiction. 


Thus M is a matching. Moreover, if U = {s,x1,...,2,} then the value of the flow is 
S| fe)- > fe) = S> fle) =|M|-1=|M]. 
eeU eeU eceU 


Thus |M| = val(f) = c(C) = |K|, so we have found a matching and a vertex cover with 
the same size. This implies that M is a maximum matching and K is a minimum vertex 


cover. Py 


Exercises 5.11. 


1. Connectivity in digraphs turns out to be a little more complicated than connectivity in 
graphs. A digraph is connected if the underlying graph is connected. (The underlying graph 
of a digraph is produced by removing the orientation of the arcs to produce edges, that is, 
replacing each arc (v,w) by an edge {v,w}. Even if the digraph is simple, the underlying 
graph may have multiple edges.) A digraph is strongly connected if for every vertices v and 
w there is a walk from v to w. Give an example of a digraph that is connected but not 
strongly connected. 


2. A digraph has an Euler circuit if there is a closed walk that uses every arc exactly once. Show 
that a digraph with no vertices of degree 0 has an Euler circuit if and only if it is connected 


and d’(v) =d_ (v) for all vertices v. 


3. A tournament is an oriented complete graph. That is, it is a digraph on n vertices, containing 
exactly one of the arcs (v, w) and (w,v) for every pair of vertices. A Hamilton path is a walk 
that uses every vertex exactly once. Show that every tournament has a Hamilton path. 


4. Interpret a tournament as follows: the vertices are players. If (v,w) is an arc, player v beat 
w. Say that v is a champion if for every other player w, either v beat w or v beat a player 
who beat w. Show that a player with the maximum number of wins is a champion. Find a 
5-vertex tournament in which every player is a champion. 


6 


Poélya—Redfield Counting 


We have talked about the number of ways to properly color a graph with k colors, given 
by the chromatic polynomial. For example, the chromatic polynomial for the graph in 
figure 6.0.1 is k+ — 4k? + 6k? — 3k, and f(2) = 2. The two colorings are shown in the 
figure, but in an obvious sense they are the same coloring, since one can be turned into the 
other by simply rotating the graph. We will consider a slightly different sort of coloring 
problem, in which we count the “truly different” colorings of objects. 


Figure 6.0.1 Cy, drawn as a square, colored in two ways. 


Many of the “objects” we color will appear to be graphs, but we will usually be 
interested in them as geometric objects rather than graphs, and we will not require that 
adjacent vertices be different colors. This will simplify the problems; counting the number 
of different proper colorings of graphs can also be done, but it is more complicated. 

So consider this problem: How many different ways are there to color the vertices of 
a regular pentagon with k colors? The number of ways to color the vertices of a fixed 
pentagon is k°, but this includes many duplicates, that is, colorings that are really the 
same. But what do we mean by “the same” in this case? We might mean that two 
colorings are the same if we can rotate one to get the other. But what about the colorings 
in figure 6.0.2? Are they the same? Neither can be rotated to produce the other, but either 
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can be flipped or reflected through a vertical line to produce the other. In fact we are free 
to decide what “the same” means, but we will need to make sure that our requirements 


are consistent. 


Figure 6.0.2 Two colorings of a pentagon. 


As an example of what can go wrong if we’re not careful, note that there are five lines 
through which the pentagon can be reflected onto itself. Suppose we want to consider 
colorings to be “the same” if one can be produced from the other by a reflection, but not 
if one can be obtained from the other by rotation. Surely if one coloring can be obtained 
by two reflections in a row from another, then these colorings should also be the same. 
But two reflections in a row equal a rotation, as shown in figure 6.0.3. So whenever we 
have some motions that identify two colorings, we are required to include all combinations 
of the motions as well. In addition, any time we include a motion, we must include the 
“inverse” motion. For example, if we say a rotation by 72° degrees produces a coloring 
that we consider to be the same, a rotation by —72° must be included as well (we may 
think of this as a rotation by 288°). Finally, since any coloring is the same as itself, we 
must always include the “trivial” motion, namely, doing nothing, or rotation by 0° if you 
prefer. 


Figure 6.0.3 Two reflections equal a rotation. 
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6.1 GROUPS OF SYMMETRIES 


The motions we want to consider can be thought of as permutations, that is, as bijections. 
For example, the rotation in figure 6.1.1 can be thought of as the function ¢ given by 


1 4 


Cy cy 
4 3 2 1 


Figure 6.1.1. The rotation ( 


As we would hope, doing two motions in a row corresponds to the compostion of the as- 


23 4 844 ' 
309 1 «5 1) is shown in figure 6.1.2, 


Doing first the rotation of figure 6.1.1 and then this reflection is shown in figure 6.1.3, and 


sociated functions. For example, the reflection 


this does indeed correspond to 


: F 1 2 3 4 5 
Figure 6.1.2. The reflection (| 2 125 ) 
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1 4 1 
cy ey cy 
4 3 2 1 3 4 
, : 12 3 4 5 123 4 5\) /f12 3 4 5 
Figure 6.1.3 The compostion ( j 2 15 aC 3 456 1 eG 5 4 3 ): 


With some restrictions, we may choose any permutations of the vertices as the allow- 
able rearrangements giving colorings that are the same. We have discussed the restrictions 
in general terms; in terms of permuations we require the following: Suppose that G is a 
set of permutations that we wish to use to define the “same coloring” relation. Then the 
following must be true: 

1. If ¢ and o are in G, so is dog. 


2. The identity permutation, id, is in G. 
3. If 6€G,¢' €G. 


DEFINITION 6.1.1 If G has the three properties above it is called a group of per- 


mutations. 


EXAMPLE 6.1.2 The group of all permutations of {1,2,...,n} is denoted S,,, the 
symmetric group on n items. It satisfies the three required conditions by simple prop- 


erties of bijections. 


In the case of the regular pentagon, there are a number of groups of permutations, 
but two are of primary interest. The five possible rotations (including the trivial rotation) 
form a group, the cyclic group of size 5. The total number of “rigid motions”, that 
is, any combination of rotations and reflections that leave the pentagon superimposed on 
itself, is 10: Once the position of vertex 1 is established, the other vertices can increase 
from 1 either clockwise or counterclockwise. The rotations provide all of the former, and 
it is easy to check that the five reflections provide the counterclockwise positions. This is 
called a dihedral group and denoted Ds. 

Suppose that G is some group of permutations of an object. If 6 € G, then ¢ induces 
a function on the colorings of the object in a natural way, and we can use the same symbol 
@ to represent this function without confusion. If c is a coloring of the object, then ¢(c) is 
the coloring that results by applying @¢ to the colored object, moving the colors with the 
object. See figure 6.1.4 for examples. We say that G acts on the set of colorings C’. 
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Li | 


Figure 6.1.4 Some examples of an induced function on colorings. 


If we apply all permutations in G to a coloring c, we get all the colorings that we 
consider to be the same as c modulo G. More formally, define c; ~ cg if there isa gE G 
such that ¢(c1) = c2; ~ is an equivalence relation on the colorings. The equivalence classes, 
called orbits in this context, group colorings that are the same together. The number of 
truly different colorings that we want to count is then the number of orbits. 

The total number of colorings of the pentagon with k colors is k°. If all orbits were 
the same size, say s, then the number of orbits would be k°/s. Unfortunately, this is not 
true. In figure 6.1.4 we see a coloring whose orbit has size at least 3, but the pentagon 
with all vertices colored red has orbit size 1. 


Exercises 6.1. 


1. Find the 12 permutations of the vertices of the regular tetrahedron corresponding to the 12 
rigid motions of the regular tetrahedron. Use the labeling below. 
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/ 
/ 53 = 
b = --+A+-- tee 


ae _— 


3 
2. Find the 12 permutations of the edges of the regular tetrahedron corresponding to the 12 
rigid motions of the regular tetrahedron. Use the labeling below. 


f 
{ s 
/ \ 
f X 
/ 
? \@a 
Cc \ 
j . 
f b ‘ 
/ \ 
/ D. 
/ Ns 
/ 7, _peneeew 
5 "ie |i wee 
ei d 


6.2 BURNSIDE’S THEOREM 


Burnside’s Theorem will allow us to count the orbits, that is, the different colorings, in a 
variety of problems. We first need some lemmas. 

If cis a coloring, |c] is the orbit of c, that is, the equivalence class of c. Let G(c) be the 
set of permutations in G that fix c, that is, those ¢ such that ¢(c) = c; the permutation 
in figure 6.1.4 fixes the coloring in the bottom row, for example. 


LEMMA 6.2.1 Gc) is a group of permutations. 


Proof. We check the properties of a group from definition 6.1.1. 
Suppose ¢ and a both fix c; then ¢(o(c)) = ¢(c) = c, so doa fixes cand doa € G(c). 
The identity permutation fixes all colorings, so id € G(c). 
If ¢(c) =c then d—1(c) = ¢—1(¢(c)) = id(c) =c, so ¢-! € G(c). 7 


LEMMA 6.2.2  |G| = |[c]| - |G(c)]. 
Proof. For ¢ and a in G, define ¢ ~ a if c~'o¢d € G(c). This is an equivalence relation: 
1. o 10a is the identity function, which is in G(c). Thus o ~ a, so the relation is 
reflexive. 
2. Ifd6~oa,a0-10¢€ Gc). Then (o-'0¢)71 € G(o), and (o 10¢) 1!=¢ loceE 
G(c), so 0 ~ @ and ~ is symmetric. 
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3. If6 ~ ao ando ~7, thena !o¢ € G(c) and t~!oa € G(c), so (T7100) 0(a~1o¢) € 
G(c). Since (r~1 0g) 0 (07-1046) =7T 106, O~7, and ~ is transitive. 

Now we claim that the equivalence class of ¢ is A = {p00 | o € G(c)}. First, suppose 
that o € G(c); then 6 !ofdoa =a € Gc), 80 600 ~ gand A C [¢]. Next, suppose 
@~T,s0T todg=y7E€ G(c). Then doy! =7, so7 € A and [¢] C A. 

Now we show that each equivalence class is the same size as G(c). Define f:G(c) > 


{poa|a€ Gc} by f(y) =¢07. If fim) = f(92), then 


go%7 = $972 
dp 'ogoy=gh iodo 
yA = G2 


so f is 1-1. Since every doy € {h00 | 0 € Gi(c)} is f(y), f is onto. 

Thus the number of equivalence classes is |G'|/|G(c)|. 

Finally, we show that the number of equivalence classes is |{c]|. Let the set of equiva- 
lence classes in G be E, that is, E = {[¢] | 6 € G}. We define g:|c] > E and show that g 
is a bijection. Suppose d € [c], so d = o(c) for some a € G. Let g(d) = |c]. 

First, we show g is well-defined. If d = o1(c) = o2(c), then a7 ' 001(c) = €, 80 01 ~ 02 
and |o1| = [09], that is, g(oi(c)) = g(aa(c)). 

Next, suppose g(di) = g(d2). This means that d; = 01(c), dz = o2(c), and [a4] = [a9]. 
Hence a3! 0 01(c) =, so o4(c) = o2(c) and thus d; = dy, so g is 1-1. 

Suppose that [a] € &. Then g(o(c)) = [co], so g is onto. 

Thus we have 


exjgaies, NG 
ll =121= oy 
and 
IG(©)| - Ile] = 1G! 
as desired. a 


COROLLARY 6.2.3 Ifc~d then |G(c)| = |G(d)|. 
Proof. Since c ~ d, [{c] = |d], and 


Ce 


DEFINITION 6.2.4 If group G acts on the colorings of an object and o € G, fix(c) is 
the set of colorings that are fixed by a. 
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THEOREM 6.2.5 Burnside’s Theorem _ If group G acts on the colorings of an 
object, the number of distinct colorings modulo G is 


a S > | fix(o)|. 


o€G 


Proof. Let C be the set of all colorings, and let O be the set of orbits. Let c,,c2,..., Cx 


be a list of colorings, one in each orbit, so that the orbits are [cz], [co],..., [cx]. Consider 
the sum S- |G(c)|: 
cEC 
k 
WIEO => YS EO 
ceC i=1 c€ [es] 
k 
Ta ys, » IG(ci)| 
i=1 ce[ei] 
t=1 cE[c;] Ile | 


eal 


k 
IG] = |G| $01 = |G|/O}. 


t=1 i=1 


1 
l= @LIGOL 


cEeC 


Then 


This already gives an interesting formula for |O], but it is unwieldy, since the number of 
colorings is typically quite large; indeed, since we typically want to compute the number 
of orbits for k colors, the number of colorings is not a fixed number. 

With just a little more work we can fix this problem: 


~lEOl=d) do 1 


cEC c€C cEG(c) 


“ryt 


o€G cE€fix(c) 


= 37 | fix(o)). 


o€G 


1 
Ol= aq DH Axo) 


o€G 


Now 


as desired. = 
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Since the group of permutations in a typical problem is fairly small, the sum in Burn- 
side’s Theorem is usually manageable. Moreover, we can make the task of computing 
| fix(o)| fairly straightforward. Let’s consider a particular example, the permutation of 
figure 6.1.4, shown again in figure 6.2.1. If we are using k colors, how many colorings of 
the pentagon are fixed by this permutation? Since the permutation swaps vertices 2 and 
5, they must be the same color if ¢ is to fix the coloring. Likewise vertices 3 and 4 must 
be the same color; vertex 1 can be any color. Thus, the number of colorings fixed by ¢ is 
k®. It is easy to see that every “flip” permutation of the pentagon is essentially the same, 
so for each of the five flip permutations, the size of fix(c) is k°. 


Figure 6.2.1 The cycles in a vertex permutation. 


Every permutation can be written in cycle form: The permutation in figure 6.2.1, for 
example, is (1)(2,5)(3,4). A cycle in this context is a sequence (21, %o,...,Z%), Meaning 
that ¢(21) = r2, (x2) = x3, and so on until ¢(a,) = x,. Following our reasoning above, 
the vertices in each cycle must be colored the same color, and the total number of colors 
fixed by ¢ is k™, where m is the number of cycles. 

Let’s apply this to another permutation, shown in figure 6.2.2. This permutation 
consists of a single cycle, so every vertex must have the same color, and the number of 
colorings fixed by ¢ is k!. All rotations of the pentagon consist of a single cycle except the 
trivial rotation, that is, the identity permutation. In cycle form, the identity permutation 
is (1)(2)(3)(4)(5), so the number of colorings fixed by the identity is k°. Putting everything 
together, we thus have 


1 1 
JOl= VP tht kt k+ kt ko +k’ +k + kY +k) = (kh? + 5k° + 4k). 


For example, the number of different 3-colorings is (3° + 5-3? +4-3)/10 = 39. 


Figure 6.2.2 The permutation (1,3,5,2,4) is a single cycle. 
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EXAMPLE 6.2.6 We find the number of distinct colorings of the vertices of a square 
with k colors, modulo D4, the dihedral group of size 8. The elements of D4 are the four 
rotations ro, 790, 180, 7270, where r; is the counterclockwise rotation by 7 degrees, and the 


four reflections fx, fv, fp, fa, as indicated in figure 6.2.3. 


1 2 1. £2 sd 2 1 27 
4 3 4 1 3 4 3°. 4 3 


Figure 6.2.3. The reflection axes for fx, fv, fp, and fa. 


In cycle notation these permutations are: 


ro = (1)(2)(3)(4) 
foo = (14,3, 2) 
riso = (1,3)(2, 4) 
ra7o = (1, 2,3, 4) 
fi = (1, 4)(2, 3) 
fv = (1, 2)(3, 4) 
fo = (1)(2, 4)(3) 
fa = (1,3)(2)(4). 


so the number of colorings is 


F(R) = G(R RARE LRH? + A+ RS RS) =o (ht + DH + 3K? + D8), 


For example, f(2) = 6; the six colorings are shown in figure 6.2.4. 


LIL 


Figure 6.2.4 The six 2-colorings of the square. 


EXAMPLE 6.2.7 Here is a more complicated example: how many different graphs are 
there on four vertices? In this case, of course, “different” means “non-isomorphic”. We 
can interpret this as a coloring problem: Color the edges of the complete graph K4 with 
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two colors, say black and white. The black edges form a graph; the white edges are the 
ones left out of the graph. The group G we need to consider is all permutations of the six 
edges of K4 induced by a permutation of the vertices, so |G| = 4! = 24. All we need to 
know is the number of cycles in each permutation; we consider a number of cases. 
Case 1. The identity permutation on the vertices induces the identity permutation on the 
edges, with 6 cycles, so the contribution to the sum is 2°. 
Case 2. A 4-cycle on the vertices induces a permutation of the edges consisting of one 
4-cycle and one 2-cycle, that is, two cycles. There are 3! = 6 4-cycles on the vertices, so 
the contribution of all of these is 6 - 2?. 
Case 3. A permutation of the vertices consisting of a 3-cycle and a 1-cycle induces a 
permutation of the edges consisting of two 3-cycles. There are 4-2 = 8 such permutations 
of the vertices, so the contribution of all is 8 - 2?. 
Case 4. A permutation of the vertices consisting of two 2-cycles induces a permuta- 
tion of the edges consisting of two 1-cycles and two 2-cycles. There are 5 (3) = 3 such 
permutations, so the contribution is 3 - 2*. 
Case 5. A permutation of the vertices consisting of a 2-cycle and two 1-cycles induces a 
permutation of the edges consisting of two l-cycles and two 2-cycles. There are (3) = 
such permutations, so the contribution is 6 - 2+. 

The number of distinct colorings, that is, the number of distinct graphs on four vertices, 
- 1 1 

6 2 2 4 4 
a 2 +6-2°4+8-2°4+3-2°+6-2*)= 54 (284) = 11. 


It is possible, though a bit difficult, to see that for n vertices the result is 


my ln? |(n—1)/2] [n/2| 
, Bie ae 
f(n) = S- I] Gee I] gkian I] gkior+4 I] QkC (5x2) I] gecd(r,s8)jrJ (6.2.1) 
j k=l k=1 k=1 k=1 1<r<s<n—1 
where the sum is over all partitions j = (j1,J2,.--,jn) of n, that is, over all j such that 


ji + 2jo + 373 +--+ +nNjn =n, and C(m,2) = ae With this formula and a computer it 
is easy to compute the number of graphs when n is not too large; for example, f(5) = 34, 
so there are 34 different five-vertex graphs. 

In light of the forgoing discussion, we can restate theorem 6.2.5. If o is a permutation, 
let ##a0 denote the number of cycles when ¢@ is written in cycle form. 


COROLLARY 6.2.8 If group G acts on the colorings of an object, the number of 
distinct colorings modulo G with k colors is 
1 


iq eas 


o€G 
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Exercises 6.2. 


1. Write the 12 permutations of the vertices of the regular tetrahedron corresponding to the 12 
rigid motions of the regular tetrahedron in cycle form. Use the labeling below. 


2. Find the number of different colorings of the vertices of a regular tetrahedron with & colors, 
modulo the rigid motions. 

3. Write the 12 permutations of the edges of the regular tetrahedron corresponding to the 12 
rigid motions of the regular tetrahedron in cycle form. Use the labeling below. 


4. Find the number of different colorings of the edges of a regular tetrahedron with k colors, 
modulo the rigid motions. 
5. Find the number of non-isomorphic graphs on 5 vertices “by hand”, that is, using the method 


of example 6.2.7. 


6.3. POLYA-REDFIELD COUNTING 


Suppose we are interested in a more detailed inventory of the colorings of an object, 
namely, instead of the total number of colorings we seek the number of colorings with a 
given number of each color. The method presented here was first published by J. Howard 
Redfield in 1927. In 1937 it was independently rediscovered by George Pélya, who then 
greatly popularized the result by applying it to many counting problems, in particular to 


the enumeration of chemical compounds. 


EXAMPLE 6.3.1 How many distinct ways are there to color the vertices of a regular 
pentagon modulo Ds so that one vertex is red, two are blue, and two are green? 
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We can approach this as before, that is, the answer is 


1 
Dai Dy, Ch 
where fix(o) now means the colorings with one red, two blues, and two greens that are 
fixed by a. No longer can we use the simple expression of corollary 6.2.8. 

The identity permutation fixes all colorings, so we need to know how many colorings 
of the pentagon use one red, two blues, and two greens. This is an easy counting problem: 
the number is (3) (5) = 30. 

If o is a non-trivial rotation, | fix(a)| = 0, since the only colorings fixed by a rotation 
have all vertices the same color. 

If o is a reflection, the single vertex fixed by o must be red, and then the remaining 
2-cycles are colored blue and green in one of two ways, so | fix(a)| = 2. 

Thus, the number of distinct colorings is 


1 
pp OF OOF OF OF 2+ 2+24+24+2) = 4. 


What we seek is a way to streamline this process, since in general the computations 
of | fix(a)| can be tedious. We begin by recasting the formula of corollary 6.2.8. 


DEFINITION 6.3.2 The type of a permutation o € S, is T(7) = (71(0), T2(),---,; T()), 
where 7;(0) is the number of i-cycles in the cycle form of a. 


Note that >", ti(o) = #0. Now instead of the simple 


1 oO 
ia] 2” 


o€G 


let us write 1 
ral Se wee gtal?), 
If we substitute x; = k for every 7, we get the original form of the sum, but the new version 
carries more information about each o. 
Suppose we want to know the number of colorings fixed by some o that use 7 reds and 
j blues, where of course i+7 =n. Using ideas familiar from generating functions, consider 


the following expression: 
(r cs by) (r? re b2)72(9) oes (r” 4 pry), 


If we multiply out, we get a sum of terms of the form r?b2, each representing some particular 
way of coloring the vertices of cycles red and blue so that the total number of red vertices 
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is p and the number of blue vertices is g, and moreover this coloring will be fixed by o. 
When we collect like terms, the coefficient of r’b’ is the number of colorings fixed by o 
that use 7 reds and j blues. This means that the coefficient of r’b’ in 


RG 4 b)T 2) (r? 4 p2)72(9) _.. (r” a. pr)7n (2) 
oE€G 


> | ix(o)| 


o€G 
where fix(c) is the set of colorings using i reds and j blues that are fixed by c. Finally, 
then, the number of distinct colorings using 7 reds and 7 blues is this coefficient divided 
by |G|. This means that by multiplying out 


| 


id Si(r a: b) TH) (p? rs p2)72(7) roe (r” ic pr)tm(o) 


o€G 


and collecting like terms, we get a list of the number of distinct colorings using any combi- 
nation of reds and blues, each the coefficient of a different term; we call this the inventory 
of colorings. If we substitute r = 1 and b = 1, we get the sum of the coefficients, namely, 
the total number of distinct colorings with two colors. 


DEFINITION 6.3.3 The cycle index of G is 


_ Tilo) 


cE€Gi=l 


EXAMPLE 6.3.4 Consider again example 6.2.6, in which we found the number of 
colorings of a square with two colors. The cycle index of D4 is 


1 1 1 3 1 
g(t +agtazt+ay+a5+a5+ajx2 + 2j{22) = gti + rite + gt + 724 


Substituting as above gives 


1 i 1 
aes b)* + gee b)?(r? + 6?) + (7 + 07)? + a + b*) = r* + rb + 2r7b* + rb? +b. 


Thus there is one all red coloring, one with three reds and one blue, and so on, as shown 


in figure 6.2.4. 


There is nothing special about the use of two colors. If we want to use three colors, we 
substitute r’ + b' + g’ for 2; in the cycle index, and for k colors we substitute something 
like ci t+ch teh +---+c4. 
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EXAMPLE 6.3.5 Let’s do the number of 3-colorings of the square. Since we already 
have the cycle index, we need only substitute x; = r’+ b’ + g’ and expand. We get 


iL 1 3 il 
girtotg) + zr tot gy (+P te) tor +b ta?) + Fri +i +9") 
= b* + bg + b?r + 2b? 9? + 2b? gr + 2b?r? + bg? + 2hg?r + 2bgr?+ 
br? + g* + g?r + 2g2r? + gr? +17. 


So, for example, there are two squares with two blue vertices, one green, and one red, from 


the b? gr term. 


EXAMPLE 6.3.6 Consider again example 6.2.7, in which we counted the number of 
four-vertex graphs. Following that example, we get 


Pa (x9 + Gao, + 8x2 + 3a7a3 + 6aiz3), 


~ 24 
and substituting for the variables x; gives 
ro br bs Ore +3 + 2r- bo ee +6. 


Recall that the “colors” of the edges in this example are “included” and “excluded”. If we 
set b= 1 and r =i (for “included” ) we get 


4a +01 4 Br 427? 44 +1, 


interpreted as one graph with 6 edges, one with 5, two with 4, three with 3, two with 2, 
one with 1, and one with zero edges, since 1 = 7°. (Of course, if we are interested in the 
inventory in the final form, we can skip the step using r and b by substituting 7; = 7i/ +1 


in the cycle index.) 


It is possible, though a bit difficult, to see that for n vertices the cycle index is 


n l [n/2| [(n—1)/2] i [n/2| ens ae 
k-1) jon j jks gcd(r,s)irjs 
Pa=) Wagga LT orm’ TD eet TD ee TD ate 
7 Ral jeg k=1 b=1 psi 
where the sums are over all partitions j = (j1, jo,.--,jn) of n, that is, over all j such that 


ji + 292 +373 +--+ +nNjn =n, and C(m, 2) = ole This is where the formula 6.2.1 comes 
from, substituting x; = 2 for all 2. 

With this formula and a computer it is easy to compute the inventory of n-vertex 
graphs when n is not too large. When n = 5, the inventory is 


(oY og? 0? Ay ae Gi? hy? 4 ea OP ee 1, 


We can use Sage to do the computations involved in finding the cycle index for graphs. 
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Exercises 6.3. 


1. Find the cycle index Pg for the group of permutations of the vertices of a regular tetrahedron 
induced by the rigid motions. (See exercise 1 in section 6.2.) 


2. Using the previous exercise, write out a full inventory of colorings of the vertices of a regular 
tetrahedron induced by the rigid motions, with three colors, as in example 6.3.5. You may 
use Sage or some other computer algebra system. 


3. Find the cycle index Pg for the group of permutations of the edges of Ks. (See exercise 5 in 
section 6.2. Don’t use the general formula above.) 


4. Using the previous exercise, write out a full inventory of the graphs on five vertices, as in 
example 6.3.6. You may use Sage or some other computer algebra system. 


Hints 


1.6.2. Every positive integer can be writ- 
ten as j -2*, with j odd and k > 0. 
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: clique number, 119 
action closed neighborhood, 83 
group action on a set, 138 closed walk, 98, 129 
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addition principle, 12 complete graph, 10, 33, 95 
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B cut, 110 
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Bell triangle, 25 pom ’ 
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symmetry, 19 
bipartite graph, 83, 103 
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cycle form, 143 

cycle form of a permutation, 39 
cycle graph (C’,), 83 

cycle index, 148 


po cyclic group, 138 

block-cutpoint graph, 117 y roup, 

bridge, 106 

Brooks’s Theorem, 121 D 

C degree, 83 
maximum, 120 

Catalan numbers, 16, 66 minimum, 111 

chain, 36 derangement, 48 


Chinese Remainder Theorem, 32 derangement numbers, 49 


chromatic number, 119 digraph, 129 


chromatic polynomial, 125 connected, 134 
circuit, 98, 129 strongly connected, 134 


class, 120 underlying graph, 134 
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dihedral group, 138 
directed edge (arc), 129 
directed graph (digraph), 129 


E 


endblock, 117 

equivalence relation, 95 

Euler circuit, 98 

Euler walk, 99 

exponential generating function, 57 


F 


falling factorial, 42 
Ferrers diagram, 61 
flow, 129 

value of, 131 
forest, 105 


G 


Galton board, 21 
general graph, 91 
generating function, 53 
exponential, 57 
girth, 119 

graph 

directed, 129 
weighted, 108 
graphical parameters, 119 
graphical sequence, 93 
greedy algorithm, 120 
group, 138 

cyclic, 138 

dihedral, 138 
symmetric, 138 


H 


Hall’s Condition, 72 
Hall’s Theorem, 72 
Hamilton cycle, 100 
Hamilton path, 100 

in digraph, 134 
handle, 115 
Handle Theorem, The, 115 


I 


incident, 9, 83 
indegree, 129 
independence number, 119 


independent set, 118 
induced subgraph, 95 
internally disjoint, 112 
inventory, 148 
isomorphic, 94 
isomorphism, 94 
isotopic, 78 

isotopy class, 78 


J 
Jarnik Algorithm, 108 


K 


Kronecker delta, 40 
Kruskal’s Algorithm, 109 


L 


Latin square, 76 
isotopic, 78 
isotopy class, 78 
orthogonol, 79 
reduced, 77 

least cost spanning tree, 108 

length, 83 

loop, 83, 91 


M 


matching, 84 

perfect, 105 

maximum degree, 120 
maximum flow, 131 
minimum degree, 111 
modulo 

colorings modulo G, 139 
multigraph, 83, 91 
multinomial coefficient, 29 
multiple edges, 91 
multiplication principle, 12 
multiset, 27 

multitree, 108 


N 


neighborhood, 83 
closed, 83 
open, 83 

network, 129 

n-set, 14 


O 


open neighborhood, 83 
orbit, 139 

Ore property, 102 
outdegree, 129 


P 


partition 

conjugate, 61 
non-crossing, 69 

of a set, 22, 69 

of an integer, 59 
self-conjugate, 62 
Pascal’s Triangle, 14 
monotonicity, 19 
symmetry, 19 
path, 838, 95 

in digraph, 129 
pendant, 105 
pendant vertex, 94 
perfect matching, 105 
permutation, 36 
cycle form, 39 
permutations, 13 
Petersen graph, 103 
pigeonhole principle, 31 
planar, 125 

Prim’s Algorithm, 109 
proper coloring, 118 


Q 


quasigroup, 78 


R 


Ramsey number, 34 
Ramsey Theory, 34 


recurrence relation, 22, 50, 62 


regular graph, 105 
repetition number, 27 
rising factorial, 42 
rooted binary tree, 66 


S 


SDR, 71 
separating set, 113 
sequence, 92 

set system, 71 
simple graph, 91 
source, 129 


spanning tree, 106 
least cost, 108 
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Stirling numbers of the second kind, 26, 39 


subgraph, 94 
symmetric group, 138 


system of distinct representatives, 71 


T 

target, 129 

tree, 105 

rooted binary, 66 
type, 147 

V 


value of a flow, 131 


4 


walk, 97 
in digraph, 129 
weighted graph, 108 


